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OM nucleic - nucleic search, using sw model 
Run on : 



December 13, 2003, 04:37:24 ; Search time 1703,81 Seconds 

(without alignments) 
7460.491 Million cell updates/s 



Title: US-09-852-261-5 
Perfect score: 523 

Sequence: 1 ggaccggagacgctctgcgg aaatacacaagtaaacattc 523 



Scoring table: 



IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 22781392 seqs, 12152238056 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



45562784 



Database : 
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em_estba : * 
em_esthum: * 
em_estin: * 
em_e s tmu : * 
em_es tov: * 
em__estpl : * 
em_estro : * 
em_htc: * 
gb_estl : * 
gb_est2:* 
gb__htc: * 
gb_est3 : * 
gb_est4 : * 
gb_est5 : * 
em_estfun: * 
em_es torn: * 
em_gss_hum: 
em__gss__inv: 
em_gss_pln: 
em_gss_vrt : 
em_gss_f un : 
em_gss_mam: 
em_gss_mus : 
em_gss_pro : 
em__gss_rod: * 
em__gss_phg : * 
em_gss_vrl : * 



28: gb_gssl: + 
29: gb_gss2:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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RESULT 1 

AI503976/C 

LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



\ 

TITLE 
JOURNAL 
COMMENT 



AI503976 558 bp mRNA linear EST ll-MAR-1999 

vm43d08.xl Stratagene mouse diaphragm (#937303) Mus musculus cDNA 
clone IMAGE: 1001007 3' similar to gb:X04482 Mouse mRNA for 
preproinsulin-like growth factor IB (MOUSE);, mRNA sequence. 
AI503976 

AI503976.1 GI:4401827 
EST. 

Mus musculus (house mouse) 
Mus musculus 
Eukaryota ; Metazoa ; 
Mammalia; Eutheria; 
1 (bases 1 to 558) 
Marra, M. , Hillier, L 



Chordata; 
Rodent ia ; 



Craniata; Vertebrata; Euteleostomi ; 
Sciurognathi; Muridae; Murinae; Mus . 



Kucaba,T., Martin, J., Beck,C, Wylie,T, 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Underwood, K. , Steptoe,M., Theising,B. f Allen, M., Bowers, Y., Person 
,B., Swaller,T., Gibbons, M. , Pape,D., Harvey, N . , Schurk,R., Ritter 
,E., Kohn,S., Shin,T., Jackson, Y., Cardenas, M. , McCann,R., 
Waterston,R. and Wilson, R. 
The WashU-NCI Mouse EST Project 1999 
Unpublished 

Contact: Marra M/WashU-NCI Mouse EST Project 1999 
Washington University School of Medicine 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108, USA 
Tel: 314 286 1800 
Fax: 314 286 1810 

Email: mouseest@watson.wustl.edu 

This clone is available royalty-free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
MGI: 565223 

This clone was previously sequenced on the 5' end only, this new 

data is from the 3 ' end 

High quality sequence stop: 440. 

Location/Qualifiers 

1. .558 

/organism="Mus musculus" 
/mol_type="mRNA M 
/db_xref="taxon: 10090" 
/clone="IMAGE: 1001007" 
/tissue_type="diaphragm" 
/dev_stage="adult M 

/lab_host="SOLR (kanamycin resistant)" 
/clone_lib="Stratagene mouse diaphragm (#937303)" 
/note="Organ: diaphragm; Vector: pBluescript SK-; Site_l: 
EcoRI; Site_2: Xhol; Cloned unidirectionally from mRNA 
prepared from diaphragm muscle. Primer: Oligo dT . Average 
insert size: 1.5 kb. Uni-ZAP XR Vector; ~5* adaptor 
sequence: 5' GAATTCGGCACGAG 3* ~3' adaptor sequence: 5 ! 
CTCGAGTTTTTTTTTTTTTTTTTT 3 ' " 
103 a 133 c 149 g 173 t 



Query Match 69.8%; Score 364.8; DB 9; 

Best Local Similarity 83.2%; Pred. No. 8.8e-81; 
Matches 441; Conservative 0; Mismatches 82; 



Length 558; 
Indels 7; Gaps 



2; 



GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

| | | I I | | | | I || I I I I I I M II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCG 471 

AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

I | I I I I || I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCTCAG 411 

ACAGGCAT C GTGGAT GAGT GCT GCT T C C GGAGCT GT GAT CT GAGGAGGCT GGAGAT GT AC 180 
| I M | I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I M I 
ACAGGCATT GTGGAT GAGT GTT GCTTCCGGAGCTGTGATCTGAGGAGACT GGAGATGTAC 351 

TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

| | | | | || || I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 
TGTGCCCCACTGAAGCCTACAAAAGCAGCCCGCTCTATCCGTGCCCAGCGCCACACTGAC 2 91 

AT GCC CAAGACT CAGAAGT AT CAGC CT C CAT CT ACCAACAAGAAAAT GAAGT CT CAGAGG 300 

I I I I I I I II I I I I I I II I I I I I I I I I II I I I I I I I I I I I M I I I I I I 

AT GC CCAAGACT CAGAAGT CCC C GT C C CT AT C GACAAACAAGAAAAC GAAGCT GCAAAGG 231 

AGAAGGAAAGGAAGTAC AT TT GAAGAAC ACAAGTAGAGGGAGT GCAGGAAACAAGAACTA 360 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AGAAGGAAAGGAAGTACATTT GAAGAACACAAGTAGAGGAAGT GCAGGAAACAAGACCTA 171 

CAGGAT GT AGGAAGAC C CTT CT GAGGAGT GAAGAAGGACAGGC CAC C GCAGGAC CCTT T G 420 

I I I M I I I I I I I I I I I I M I I I I I I I I I I I II I I M I I 

C AGAAT GT AGGAGGAGC CT C C CAC GGAGCAGAAAAT GC CAC AT CACC GCAGGAT CCT T T G 111 

CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATC 474 

|| I I I I I I I I I I I I I Ml I I I I I I I II II 

CT GCTT GAGCAACCT GCAAAAC AT C GAAACAC CT AC CAAAT AACAAT AAT AAGT CCAAT A 5 1 

AC ATT T CAAAGAT - GGC ATT T C C CC CAAT GAAATAC ACAAGT AAACATT C 52 3 
| I M I I I I I II I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 
ACATT ACAAAGAT GGGC ATT T C CCC CAAT GAAAT AT ACAAGT AAAC AT TC 1 



RESULT 2 
AW146128/C 

LOCUS AW146128 623 bp mRNA linear EST 10-OCT-2000 

DEFINITION um37el0.xl Sugano mouse embryo mewa Mus musculus cDNA clone 
IMAGE:2247498 3* similar to gb:X04482 Mouse mRNA for 
preproinsulin-like growth factor IB (MOUSE);, mRNA sequence. 

ACCESSION AW146128 

VERSION AW146128.1 GI: 6167864 

KEYWORDS EST . 

SOURCE Mus musculus (house mouse) 

ORGANISM Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
REFERENCE 1 (bases 1 to 623) 

AUTHORS Marra,M., Hillier,L., Kucaba,T., Martin, J., Beck,C, Wylie,T., 

Underwood, K. , Steptoe,M., Theising,B., Allen, M. , Bowers, Y., Person 
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TITLE 
JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



,B., Swaller,T., Gibbons, M., Pape,D., Harvey, N., Schurk,R., Ritter 

,E., Kohn,S., Shin,T., Jackson, Y., Cardenas,M., McCann,R., 

Waterston,R. and Wilson, R. 

The WashU-NCI Mouse EST Project 1999 

Unpublished 

Contact: Marra M/WashU-NCI Mouse EST Project 1999 
Washington University School of Medicine 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108, USA 
Tel: 314 286 1800 
Fax: 314 286 1810 

Email: mouseest@watson.wustl.edu 

This clone is available royalty-free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
MGI : 1006958 

Seq primer: custom primer used 
High quality sequence stop: 499. 

Location/Qualifiers 

1. .623 

/organism="Mus musculus" 

/mol_type="mRNA" 

/strain="C57BL" 

/db_xref="taxon: 10090" 

/ clone=" IMAGE :2247498" 

/dev_stage="embryo, 14 dpc" 

/lab_host= M DH10B" 

/clone_lib="Sugano mouse embryo mewa" 

/note="Vector: pME18S-FL3; Site_l: Drain (CACTGTGTG); 
Site_2: Dralll (CACCATGTG) ; 1st strand cDNA was primed 
with an oligo(dT) primer [ATGTGGCCTTTTTTTTTTTTTTTTT] ; 
double-stranded cDNA was ligated to a Dralll adaptor 
[TGTTGGCCTACTGG] , digested and cloned into distinct Dralll 
sites of the pME18S-FL3 vector (5' site CACTGTGTG, 3' site 
CACCATGTG) . Xhol should be used to isolate the cDNA 
insert. Size selection was performed to exclude fragments 
<1.5kb. Library constructed by Dr. Sumio Sugano 
(University of Tokyo Institute of Medical Science) . 
Custom primers for sequencing: 5' end primer 
CTTCTGCTCTAAAAGCTGCG and 3' end primer 
CGACCTGCAGCTCGAGCACA. " 
123 a 138 c 170 g 191 t 1 others 



Query Match 69.4%; 
Best Local Similarity 81.9%; 
Matches 433; Conservative 



Score 363; DB 9; Length 623; 
Pred. No. 2.5e-80; 
0; Mismatches 90; Indels 



6; Gaps 



l; 



Qy 

Db 

Qy 
Db 

Qy 

Db 



1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 

I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I M II II I I I I II I I 

541 GGACCAGAGACCCTTTTCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCG 



61 



481 



AGGGGCT T T T ATT T CAACAAGC CCACAGGAT AC GGCT C C AGC AGT CGGAGGGC AC CT CAG 

I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I M I I I II I I I I I I I II I I I I II I 

AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCTCAG 



121 AC AGGC AT C GT GGATGAGT GCT GCT TC C GGAG CT GT GAT CTGAGGAGGCT GGAGAT GTAC 
| | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I MINI 
421 AC AGGCATT GT GGATGAGT GT T GCT TC C GGAGCT GT GAT CT GAGGAGACT GGAAAT GTAC 



60 



482 



120 



422 



180 



362 



Qy 
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Db 


361 


Qy 


241 


Db 


301 


Qy 


301 


Db 


241 


Qy 


361 




181 


Qy 


421 


Db 


121 


Qy 


475 


Db 


61 



TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

M I I I II II I I I I I M II I I I I I I I I I I I II I I I I I I I I I II I I I I I I IN 

TGTGCCCCACTGAAGCCTACAAAAGCAGCCCGCTCTATCCGTGCCCAGCGCCACACTGAC 302 

AT GC C CAAGACT CAGAAGT AT C AGCCT C CAT CT AC CAACAAGAAAAT GAAGT CT CAGAGG 300 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I IN 

AT GCCCAAGACT CAGAAGT CCCCGT CCCTATCGACAAACAAGAAAACGAAGCT GCAAAGG 242 

AGAAGGAAAG GAAGT ACATT T GAAGAAC ACAAGTAGAGGGAGT GC AGGAAACAAGAACT A 360 

II I I II I I I I I I I I I II I M I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I III 

AGAAGGAAAGGAAGT ACAT T T GAAGAACC CAAGT AGAGGAAGT GC AGGAAACAAGAC CTA 182 
C AGGAT GTAGGAAGAC CCTT CT GAGGAGT GAAGAAGGAC AGGC CAC C GC AGGAC CCT T T G 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 
CAGAAT GTAGGAGGAGCCT CCCACGGAGCAGAAAATGCCACAT CACCGCAGGATCCTTTG 122 



I II I I I I I I II III I I I I I I I I I 



I I I I I I I I I I I I I I M I I I I I I I I I I I I I II I I I I I I I I I I I I 



RESULT 3 

AI169253/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



AI169253 549 bp mRNA linear EST 08-JAN-1999 

EST215088 Normalized rat kidney, Bento Soares Rattus sp. cDNA clone 
RKIBP33 3 1 end, mRNA sequence. 
AI169253 

AI169253.1 GI:4134375 
EST. 

Rattus sp. 
Rattus sp. 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; 
Rattus . 

1 (bases 1 to 549) 

Lee,N.H., Glodek,A., Chandra, I., Mason, T.M., Quackenbush, J. , 
Kerlavage, A. R. and Adams, M.D. 

Rat Genome Project: Generation of a Rat EST (REST) Catalog & Rat 

Gene Index 

Unpublished 

On Oct 6, 1998 this sequence version replaced gi: 3705561. 
Other_ESTs: TC50779 
Contact: Lee, NH 

The Institute for Genomic Research 

9712, Medical Center Drive, Rockville, MD 20850, USA 
Tel: (30D-838-3529 
Fax: (30D-838-0208 
Email: nhlee@tigr.org 
Seq primer: M13-21. 

Location/Qualifiers 

1. .549 

/organism="Rattus sp." 



BASE COUNT 
ORIGIN 



/mol_type="mRNA" 
/db_xref="taxon: 10118" 
/clone- ,, RKIBP33" 

/clone_lib="Normalized rat kidney, Bento Soares" 
/note="Organ: kidney; Vector: pT7T3Pac; Site__l: EcoRI; 
Site_2: Notl" 
112 a 140 c 133 g 164 t 



Query Match 66.6%; 
Best Local Similarity 81.8%; 
Matches 42 8; Conservative 



Score 34 8.2; DB 9; 
Pred. No. 1.3e-76; 
0; Mismatches 88; 



Length 549; 
Indels 7; 



Gaps 



2; 



Qy 



Db 



8 AGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGACAGGGGCT 67 

II I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
549 AGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAATTCGTGTGTGGACCAAGGGGCT 4 90 



Qy 



Db 



68 T TT AT T TCAACAAGC C C ACAGGAT ACGGCT C CAGC AGT C GGAGGGC ACCT CAGAC AGGCA 127 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I III MMIM I I I I I I I I I 
48 9 T TT ACT TCAACAAGC C C ACAGGCT AT G GCT CCAGC AT T C GAAGGGC AC CACAGAC GGGCA 430 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 



12 8 TC GT GGAT GAGT GCT G CTT CC G GAGCT GT GAT CT GAGGAGGCT GGAGAT GTACT GT GCAC 187 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
42 9 TTGT GGAT GAGT GTAGCTT CCGGAGCT GT GATCT GAGGAGGCT GGAGAT GTACT GT GCTC 370 

18 8 CCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGACATGCCCA 247 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
369 CGCTGAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACATTGACATGCCCA 310 

24 8 AGACT CAGAAGT AT CAGC CT C CAT CT AC CAACAAGAAAAT GAAGTCT C AGAGGAGAAGGA 307 

I I I I I I I I I I I I I I II I I III II I I I I II I I I I I I I II I I I I I I I II I 
309 AGACT CAGAAGT CC CAGC C CCT AT C GACAC ACAAGAAAAGGAAGCT GCAAAGGAGAAGGA 2 50 

308 AAGGAAGTACATTT GAAGAACACAAGTAGAGGGAGT GCAGGAAACAAGAACTACAGGATG 367 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
249 AAGGAAGT AC ACTT GAAGAACACAAGT AGAGGAAGT GCAGGAAACAAGACTT ACAGAAT G 190 

368 T AGGAAGACCCTTCT GAGGAGT GAAGAAGGACAGG CCAC C GC AGGAC C CTTT GCT CT GCA 427 

I I I I I II III I I I I I I I II I II I I I I I I I I II I I I I I I I I 

18 9 TAGGAGGAGC CT CC CGAGGAACAGAAAAT GC CAC GT CAC C GCAAGAT CCTTTGCTGCTTG 130 

42 8 C AGT T AC CT GT AAAC AT T GGAAT AC C GG C C A AAAAATAAGTT T GAT CACATT T C 4 81 

I I I I I II I I I I I I I I I I I I I I II I I I I II I I I I I I 

12 9 AGCAACCT GCAAAACAT CGGAACACCT GCCAAATAT CAATAAT GAGTTCAATACCATTT C 70 

4 82 AAAGAT - GGCATTT CCCCCAATGAAATACACAAGTAAACATT C 523 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
69 AGAGAT GGGCATTT CCCTCAAT GAAATACACAAGTAAACATT C 27 



RESULT 4 
AI265629/C 

LOCUS AI265629 558 bp mRNA linear EST 18-NOV-1998 

DEFINITION uj04b07.xl Sugano mouse liver mlia Mus musculus cDNA clone 
IMAGE: 1890901 3' similar to gb:X04482 Mouse mRNA for 
preproinsulin-like growth factor IB (MOUSE) ; , mRNA sequence. 
ACCESSION AI26562 9 



VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



AI265629.1 GI: 38 83787 
EST . 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 
1 (bases 1 to 558) 

Marra,M., Hillier,L., Allen, M. , Bowles, M. , Dietrich, N., Dubuque, T., 
Geisel,S., Kucaba,T., Lacy,M., Le,M. , Martin, J., Morris, M., 
Schellenberg,K. , Steptoe,M., Tan,F., Underwood, K. , Moore, B. , 
Theising,B., Wylie,T., Lennon,G., Soares,B., Wilson, R. and 
Waterston, R. 

The WashU-HHMI Mouse EST Project 
Unpublished 

Contact: Marra M/Mouse EST Project 

WashU-HHMI Mouse EST Project 

Washington University School of MedicineP 

4444 Forest Park Parkway, Box 8501, St, Louis, MO 63108 

Tel: 314 286 1800 

Fax: 314 286 1810 

Email: mouseest@watson.wustl.edu 

This clone is available royalty-free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
MGI: 975225 

Seq primer: custom primer used 
High quality sequence stop: 495. 

Location/Qualifiers 

1. .558 

/organism="Mus musculus" 
/mol_type= M mRNA" 
/strain="C57BL" 
/db_xref="taxon: 10090" 
/clone-" IMAGE: 1890901" 
/sex="f emale" 
/dev_s t age= " adul t " 
/lab_host="DH10B" 

/clone lib="Sugano mouse liver mlia" 

/note="Organ: liver; Vector: pME18S-FL3; Site_l: Dralll 
(CACTGTGTG) ; Site_2 : Dralll (CACCATGTG) ; 1st strand cDNA 
was primed with an oligo(dT) primer 

[ATGTGGCCTTTTTTTTTTTTTTTTT] ; double-stranded cDNA was 
ligated to a Dralll adaptor [TGTTGGCCTACTGG] , digested 
and cloned into distinct Dralll sites of the pME18S-FL3 
vector (5 1 site CACTGTGTG, 3' site CACCATGTG). Xhol should 
be used to isolate the cDNA insert. Size selection was 
performed to exclude fragments <1.5kb. Library 
constructed by Dr. Sumio Sugano (University of Tokyo 
Institute of Medical Science) . Custom primers for 
sequencing: 5' end primer CTTCTGCTCTAAAAGCTGCG and 3' end 
primer C GAC CT GC AGCT C GAGCAC A . " 
106 a 135 c 156 g 161 t 



Query Match 66.3%; 
Best Local Similarity 82.0%; 
Matches 414; Conservative 



Score 347; DB 9; Length 558; 
Pred. No. 2.5e-76; 
0; Mismatches 85; Indels 6; Gaps 



1; 



Qy 


1 


Db 


506 


Qy 


61 


Db 


446 


Qy 


121 


Db 


386 


Qy 


181 


Db 


326 


Qy 


241 


Db 


266 


Qy 


301 


Db 


206 


Qy 


361 


Db 


146 


Qy 


421 


Db 


86 


Qy 


475 


Db 


26 



GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 
M | I I | | | | I || I I I I I I I I I I II I I I I I I I M I I I I I I I I I I I I I I I I I I I I 



447 



180 
327 



300 



| | I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I M I I 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Mill II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I III 



CAGGAT GTAGGAAGACCCTTCT GAGGAGT GAAGAAGGACAGGCCACCGCAGGACCCTTTG 420 
II I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
CAGAAT GT AGGAGGAGCCT CC C AC GGAGCAGAAAAT GC CAC AT CAC C GCAGGAT CCTT T G 8 7 

CT CT GCACAGT T ACCT GT AAACAT TGGAAT ACC G GC C A AAAAATAAGTTT GATC 474 

|| I I I I I I I I II I I I III I I I I I I I I I II 



I I I I I I I I I I I I I I II I I I I 



RESULT 5 
CD373004 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 



CD373004 614 bp mRNA linear EST 29-MAY-2003 

UI-R-GRO-csv- j-17-0-UI . rl UI-R-GR0 Rattus norvegicus cDNA clone 
UI-R-GRO-csv- j-17-0-UI 5 1 , mRNA sequence. 
CD373004 

CD373004.1 GI: 31157094 
EST. 

Rattus norvegicus (Norway rat) 
Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 
Rattus . 

1 (bases 1 to 614) 

Bonaldo,M. F. , Lennon,G. and Soares,M.B. 

Normalization and subtraction: two approaches to facilitate gene 
discovery 

Genome Res. 6 (9), 791-806 (1996) 

97044477 

8889548 



COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Contact: Soares, MB 

Coordinated Laboratory for Computational Genomics 
University of Iowa 

375 Newton Road , 4156 MEBRF, Iowa City, IA 52242, USA 

Tel: 319 335 8250 

Fax: 319 335 9565 

Email: bento-soares@uiowa.edu 

Tissue Procurement: James Lin, University of Iowa 
cDNA Library preparation: Dr. M. Bento Soares, University of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Soares, University of Iowa 
DNA Sequencing by: Dr. M. Bento Soares, University of Iowa 
Clone Distribution: Distribution information can be found at 
http : //genome . uiowa . edu/ distribution/ rat . html 
Seq primer: M13 REVERSE. 

Location/Qualifiers 
1. .614 

/organism="Rattus norvegicus" 
/mol__type-"mRNA" 
/strain="Sprague-Dawley" 
/db_xref="taxon: 10116" 
/clone="UI-R-GR0-csv-j-17-0-UI" 
/tissue_type="Whole embryo" 
/dev_stage="embryo 13dpc" 

/lab_host="DH10B (Life Technologies) (Tl phage resistant)" 
/clone_lib="UI-R-GR0" 

/note="Vector: pYX-Asc; Site_l: EcoR I; Site_2: Not I; 
UI-R-GRO is a cDNA library containing the following 
tissue(s): rat whole embryo 13dpc. The library was 
constructed according to Bonaldo, Lennon and Soares, 
Genome Research, 6:791-806, 1996. Denatured RNA was size 
fractionated on a 1% agarose gel. First strand cDNA 
synthesis was primed with oligo-dT primer containing a Not 
I site. Double strand cDNA was size selected according to 
mRNA size fraction, ligated with EcoR I adaptor, digested 
with NotI and then cloned directionally into pYX-Asc 
vector. The library tag sequence located between the Not I 
site and the polyA tail is CATCTCTACT. This library was 
created for the University of Iowa Program for Rat Gene 
Discovery and Mapping (Val Sheffield, Bento Soares and Tom 
Casavant) . " 

171 a 168 c 154 g 119 t 2 others 



Query Match 64.9%; 
Best Local Similarity 81.4%; 
Matches 393; Conservative 



Score 339.6; DB 14; 
Pred. No. 1.9e-74; 
0; Mismatches 90; 



Length 614; 
Indels 0; 



Gaps 



0; 



Qy 



Db 



1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
116 GGAC CAGAGAC C CT TT GC GGGGCT GAGCT GGT GGAC GCT CT T CAGT T C GT GT GTGGAC CA 175 



Qy 



Db 



61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

M I I I I I I I I I I I I I I I I I I I I I II I I I II I I II I I I I I I I I I II I I I I I I I III 
176 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCACAG 235 



Qy 



121 ACAG GCAT C GT GGAT GAGT GCT GCT T C C GGAGCT GT GAT CT GAG GAGGCT GGAGAT GT AC 18 0 
II I I I I I II | | | I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 



Db 



236 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 2 95 



Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

I I I I I || M I I I I I I I I I I I I I I M Ml I I I I I I I I I I I I I I I I I I Ml 

Db 296 TGTGCTCCGCT GAAG C CTACAAAGT CAGCT CGTT C CAT C C GGGC C CAGC GC C ACACT GAC 355 

Qy 241 AT GC CCAAGACT C AGAAGT AT C AGCCT C CAT CT AC CAACAAGAAAAT GAAGT CT C AGAGG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 356 AT GCCCAAGACT CAGAAGT CCCAGCCCCT ATCGACACACAAGAAAAGGAAGCT GCAAAGG 415 

Qy 301 AGAAGGAAAGGAAGT ACAT T T GAAGAACACAAGT AGAGGGAGT GCAGGAAACAAGAACTA 360 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I Ml 
Db 416 AGAAGGAAAGGAAGT ACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 475 

Qy 361 CAGGAT GTAGGAAGAC CCTTCT GAGGAGT GAAGAAGGACAGGCCACCGCAGGACCCTTTG 42 0 

Ml I I I I I I I I II III I I I I I I I II II I I I I I I I I M I I I I I I 

Db 47 6 CAGAAT GTAGGAGGAGCCT CCCGAGGAACAGAAAATTCCACGTCAC CGCATGAT CCTTTG 535 

Qy 421 CT CT GCACAGT T AC CT GTAAAC AT T GGAAT ACCGGC CAAAAAATAAGT TT GAT CACATTT 4 80 

|| I I I I I I I II I II I I I I I I I I II IN Ml 

Db 536 CT GCTTGAGCAACCT GCANAACAT CGGAACACCTGCCAAATAT CAATAATGAGTTCAATA 595 

Qy 481 CAA 483 

I I 

Db 596 CCA 598 



RESULT 6 
AI119218 
LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
COMMENT 



AI119218 816 bp mRNA linear EST 02-SEP-1998 

ue94h02.yl Sugano mouse embryo mewa Mus musculus cDNA clone 
IMAGE: 1498803 5' similar to gb:X04482 Mouse mRNA for 
preproinsulin-like growth factor IB (MOUSE) ; , mRNA sequence. 
AI119218 

AI119218.1 GI:3519542 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 816) 

Marra,M., Hillier,L., Allen, M. , Bowles, M. , Dietrich, N., Dubuque, T . , 
Geisel,S., Kucaba,T., Lacy,M., Le,M. , Martin, J., Morris, M., 
Schellenberg,K. , Steptoe,M., Tan,F., Underwood, K. , Moore, B. , 
Theising,B., Wylie,T., Lennon,G., Soares,B., Wilson, R. and 
Waterston, R. 

The WashU-HHMI Mouse EST Project 
Unpublished 

Contact: Marra M/Mouse EST Project 

WashU-HHMI Mouse EST Project 

Washington University School of MedicineP 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108 

Tel: 314 286 1800 

Fax: 314 286 1810 

Email: mouseest@watson, wustl . edu 

This clone is available royalty-free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 



FEATURES 

source 



BASE COUNT 
ORIGIN 



MGI: 936407 

Seq primer: custom primer used 
High quality sequence stop: 473. 

Location/Qualif iers 

1. .816 

/organism="Mus musculus" 

/mol_type= ,, mRNA M 

/strain="C57BL M 

/db_xref="t axon: 10090" 

/ clone= " IMAGE :1498803" 

/dev_stage=" embryo, 14 dpc" 

/lab_host="DH10B f ' 

/clone_lib="Sugano mouse embryo mewa" 

/note="Vector : pME18S-FL3; Site_l: Drain (CACTGTGTG) ; 
Site_2: Dralll (CACCATGTG) ; 1st strand cDNA was primed 
with an oligo(dT) primer [ATGTGGCCTTTTTTTTTTTTTTTTT] ; 
double-stranded cDNA was ligated to a Dralll adaptor 
[TGTTGGCCTACTGG] , digested and cloned into distinct Dralll 
sites of the pME18S-FL3 vector (5' site CACTGTGTG, 3' site 
CACCATGTG) . Xhol should be used to isolate the cDNA 
insert. Size selection was performed to exclude fragments 
<1.5kb. Library constructed by Dr. Sumio Sugano 
(University of Tokyo Institute of Medical Science) . 
Custom primers for sequencing: 5' end primer 
CTTCTGCTCTAAAAGCTGCG and 3 ! end primer 
C GAC CT GCAGCT C GAGCACA . " 
230 a 219 c 172 g 187 t 8 others 



Query Match 64.9%; 
Best Local Similarity 81.2%; 
Matches 389; Conservative 



Score 339.2; DB 9; 
Pred. No. 2.5e-74; 
0; Mismatches 90; 



Length 816; 



Indels 



0 ; Gaps 



0; 



Qy 



Db 



323 



GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I M I I I I I I M I II I I I M I 
GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCG 382 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II 

383 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCTCAG 442 

121 ACAG GC AT CGT G GAT GAGT GCT G CTT C C GGAGCT GT GAT CT GAG GAG GCT GGAGAT GT AC 18 0 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

443 AC AGGC ATTGT GGAT GAGT GT T GCTT CC GGAGCT GT GAT CT GAGGAGACT GGAGAT GT AC 502 



181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 

I I I M || II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I Ml 
503 TGTGCCCCACTGAAGCCTACAAAAGCAGCCCGCTCTATCCGTGCCCAGCGCCACACTGAC 

241 AT GC C CAAGACT CAGAAGT AT CAGC CT C CAT CT ACCAACAAGAAAAT GAAGT CT C AGAGG 
I I I I I II I I I I I I I I I II I I I I I III II I I I I I I I II I I I I I M Ml 
563 AT GC CCAAGACT CAGAAGT CCCC GT CCCTATCGACAAACAAGAAAACGAAGCTGCAAAGG 



240 



562 



300 



622 



Qy 



Db 



301 AGAAGGAAAGGAAGT ACAT TT GAAGAAC ACAAGT AGAGGGAGT GCAGGAAACAAGAACT A 360 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

623 AGAAGGAAAGGAAGT AC AT TT GAAGAAC ACAAGT AGAGGAAGT GCAN GAAACAAGAC CT A 682 



361 CAGGAT GTAGGAAGACCCTT CT GAGGAGT GAAGAAGGACAGGCCACCGCAGGACCCTTTG 420 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

683 CAGAAT GTANGAGGAG C CTN C CACGGAGC AGAAN AT GCCACAT CAC C GCAN GAT C CT TT G 742 

421 CTCTGCACAGTTACCT GTAAACATTGGAAT ACCGGCCAAAAAATAAGTTT GAT CACATT 479 

II I I I I I I I I I I I I I I I I I I I I III III 

743 CTGCTTGAGC^ACCTGCANAACATCGAAACACCTACCAAATAACATNTATAAGTCCAAT 801 



RESULT 7 
BF383724 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



BF383724 594 bp mRNA linear EST 27-NOV-2000 

602044632F1 NCI_CGAP__Li9 Mus musculus cDNA clone IMAGE: 4194295 5', 
mRNA sequence. 
BF383724 

BF383724 . 1 GI: 1136502 9 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 594) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Jeffrey E. Green, M.D. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consortium/LLNL at: 
http : //image . llnl . gov 
Plate: LLAM9527 row: p column: 08 
High quality sequence stop: 589. 

Location/Qualifiers 

1. .594 

/organism- "Mus musculus" 
/mol_type= M mRNA" 
/strain="FVB/N" 
/db_xref="taxon: 10090" 
/clone="IMAGE: 4194295" 

/lab_host="DH10B (Tl phage-resistant ) " 
/clone_lib="NCI_CGAP_Li9" 

/note="0rgan: liver; Vector: pCMV-SPORT6; Site_l: NotI; 
Site_2: Sail; Cloned unidirectionally . Primer: Oligo dT. 
Average insert size 1.9 kb. Constructed by Life 
Technologies. Note: this is a NCI_CGAP Library." 
175 a 162 c 142 g 115 t 



Query Match 64.0%; 
Best Local Similarity 82.0%; 
Matches 400; Conservative 



Score 334.8; DB 10; 
Pred. No. 2.9e-73; 
0; Mismatches 82; 



Length 594; 



Indels 



6; Gaps 



l; 



Qy 16 TGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGACAGGGGCTTTTATTTC 75 



I I I I I I I I I I I I I I I I I I I I I II I I I II I II I M I I I I I I I I I I I I I I I I I I III 

Db 107 TGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCGAGGGGCTTTTACTTC 166 

Qy 7 6 AACAAGCCC AC AGGAT ACGGCT C C AGCAGT CG GAGGGC AC CT CAGACAGGC AT C GTGGAT 135 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 167 AACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCTCAGACAGGCATTGTGGAT 226 

Qy 136 GAGT G CT GCT T CC GGAGCT GT GAT CT GAG GAG GCT GGAGAT GT ACT GT G CAC C C CT CAAG 195 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II Ml 

Db 227 GAGT GTT GCTT CC GGAGCT GT GAT CT GAGGAGACT GGAGAT GT ACT GT GCC C CACT GAAG 28 6 

Qy 196 CCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGACATGCCCAAGACTCAG 255 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II 

Db 287 C CT ACAAAAG CAGCC C GCT CT AT C CGT GC C CAGC GCCACACT GACAT GC CCAAGACT CAG 34 6 

Qy 256 AAGT AT C AGC CT C CAT CT AC CAACAAGAAAAT GAAGT CT CAGAGGAGAAGGAAAGGAAGT 315 

I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 

Db 347 AAGT C CC C GT C C CT AT C GACAAACAAGAAAAC GAAGCT GCAAAGGAGAAGGAAAGGAAGT 406 

Qy 316 ACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTACAGGATGTAGGAAGA 375 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I I I II 

Db 407 AC ATT TGAAGAACACAAGT AGAGGAAGT GC AGGAAACAAGAC CTACAGAAT GT AGGAGGA 466 

Qy 37 6 C C CT T CT GAGGAGT GAAGAAGGAC AGGC C ACC GC AGGAC C CTTT GCT CT GCAC AGTT AC C 435 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4 67 GCCTCCCACGGAGCAGAAAATGCCACATCACCGCAGGATCCTTTGCTGCTTGAGCAACCT 52 6 

Qy 436 T GTAAACAT T GGAAT AC CGGC C A AAAAATAAGT T T GAT CACATTT CAAAGAT GG 48 9 

I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 527 GCAAAAC AT CGAAACAC CT AC CAAAT AACAAT AAT AAGT C CAATAACAT TACAAAGAT GG 586 

Qy 4 90 CATTTCCC 497 

I I I I 

Db 587 GCATTGCC 594 



RESULT 8 
CB959991 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



CB959991 796 bp mRNA linear EST 29-APR-2003 

AGENCOURT_13888044 NIH_MGC_147 Homo sapiens cDNA clone 
IMAGE: 30341081 5', mRNA sequence. 
CB959991 

CB959991. 1 GI: 30216107 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 796) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection 
Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail . nih. gov 
Tissue Procurement: Dr. Stefan Hansson 

cDNA Library Preparation: Michael J. Brownstein (NHGRI) 
and advice from Piero Carninci (RIKEN) 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



(MGC) 



with help 



FEATURES 

source 



BASE COUNT 
ORIGIN 



cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consortium/ LLNL at: 
http : //image . llnl . gov 
Plate: NDAM371 row: p column: 18 
High quality sequence stop: 707. 
Location/Qualifiers 
1. .796 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon:9606" 
/clone="IMAGE: 30341081" 
/tissue_type="Human Placenta" 
/lab_host="DH10B TonA" 
/clone_lib="NIH_MGC_147" 

/note= M Organ: placenta; Vector: pBluescriptR; Site_l : 
all-XhoI; Site_2: BamH; Oligo-dT primed using primer 
5 1 -TTTTTTTTTTTTTTTTVN-3 1 , size-selected for average 
insert size 2.3 kb and normalized to ROT 5, This is a 
primary library enriched for full-length clones and 
constructed using the Cap-trapper method (Carninci, in 
preparation) . Library constructed by M. Brownstein 
(NIMH/NHGRI, National Institutes of Health) . Note: This is 
a NIH_MGC library." 
224 a 197 c 191 g 184 t 



Query Match 63.9%; 
Best Local Similarity 84.6%; 
Matches 445; Conservative 



Score 334.4; DB 14; Length 796; 
Pred. No. 3.9e-73; 
0; Mismatches 26; Indels 55; 



Gaps 



4; 



Qy 


1 


Db 


180 


Qy 


61 


Db 


240 


Qy 


121 


Db 


300 


Qy 


181 


Db 


360 


Qy 


241 


Db 


420 


Qy 


301 


Db 


435 



GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 
GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 239 



AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I 
AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 



120 



299 



180 



AC AGGCAT C GT GGAT GAGT GCT GCT T C C GGAGCT GT GAT CT GAGGAGGCT GGAGAT GT AC 

I | | | | I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 359 

TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

II I I I I I I I I I I I I I I II Ml I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 419 



I I I II I I I I I III 



AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 
I I I I I I I II I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I 
AAGGAAGTACATTT GAAGAACGCAAGTAGAGGGAGT GCAGGAAACAAGAACT A 



434 



360 



487 



Qy 



361 CAGGATGTAGGAAGACCCTT CT GAGGAGT GAAGAAGGACAGGCCACCGCAGGACCCTTTG 420 



Db 48 8 C AGGAT GT AGGAAGAC C CT C CT GAGGAGT GAAGAGT GACAT GCCAC C GC AGGAT C CT T T G 547 

Qy 421 CT CT GCAC - AGT T ACCT G- T AAACATT GGAATACCGGC CAAAAAATAAGTT T GAT C ACAT 478 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I 

Db 54 8 CT CT GCAC GAGT T ACCT GT T AAACT TT GGAACACCT AC CAAAAAATAAGTT T GAT AAC AT 607 

Qy 479 T T CAAAGAT - GGCATT T C C C C CAAT GAAAT ACACAAGT AAACAT T C 523 

II I I I I I I III I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 608 T TAAAAGAT GGGCGTTTCCCC CAAT GAAAT ACACAAGT AAACAT T C 653 



RESULT 9 

AW495481/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



FEATURES 

source 



AW495481 499 bp mRNA linear EST 24-FEB-2000 

UI-M-BH3-auy-g-ll-0-UI . si N I H_BMAP_M_S 4 Mus musculus cDNA clone 
UI-M-BH3-auy-g-ll-0-UI 3 1 , mRNA sequence. 
AW495481 

AW4 954 81.1 GI : 7 065762 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 499) 

Bonaldo,M. F. , Lennon,G. and Soares,M.B. 

Normalization and subtraction: two approaches to facilitate gene 
discovery 

Genome Res. 6 (9), 791-806 (1996) 

97044477 

8889548 

Contact: Chin, H 

National Institute of Mental Health 

6001 Executive Blvd. Room 7N-7190, MSC 9643, Bethesda, MD 

20892-9643, USA 

Tel: 301 443 1706 

Fax: 301 443 9890 

Email : mEST@mail . nih . gov 

The sequence contained an oligo-dT track that was present in the 
oligonucleotide that was used to prime the synthesis of first 
strand cDNA and therefore this may represent a bonafide poly A 
tail. The sequence tag present in the cDNA between the NotI site 
and the oligo-dT track served to identify it as a clone from the 
normalized pineal glands library cDNA Library Preparation: M.B. 
Soares Lab Clone distribution: Researchers may obtain BMAP cDNA 
clones from RESEARCH GENETICS. It should be noted that Bento Soares 
is generating a small number of additional specialized 
non-redundant arrays of BMAP cDNAs whose availability will be 
considered under appropriate and limited collaborative arrangements 
Seq primer: M13 Forward 
POLYA=Yes . 

Location/Qualifiers 

1. .499 

/organism="Mus musculus" 
/mol_type="mRNA" 
/strain="C57BL/6J" 
/db xref="taxon: 10090" 



BASE COUNT 
ORIGIN 



/clone="UI-M-BH3-auy-g-ll-0-UI" 
/dev_stage="27-32 days" 
/lab_host="DH10B (Life Technologies)" 
/ cl one_l ib= "NI H_BMAP_M_S 4 " 

/note="Vector : pT7T3D-Pac (Pharmacia) with a modified 
polylinker; Site_l: Not I; Site_2 : Eco RI ; The 
NIH_BMAP_M_S4 library is a subtracted library of a series, 
ultimately derived from a mixture of individually tagged 
normalized libraries from ten regions of the mouse brain 
(cerebellum, brain stems, olfactory bulbs, hypothalamus, 
cortex, amygdala, basal ganglia, pineal gland, striatum, 
hipoccampus) after a series of subtractions to reduce the 
representation of cDNAs from which ESTs had already been 
generated. The following serially subtracted libraries 
were generated in this process: NIH_BMAP_M_S4 , 
N I H_BMAP_M_S 3.3, N I H_BMAP _M_S 3.2, N I H_BMAP_M_S 3.1, 
NIH_BMAP_M_S2, NIH_BMAP_M_S1 . The subtracted library 
(NIH_BMAP_M_S4 ) was constructed as follows: PCRamplified 
cDNA inserts from NIH_BMAP_M_S3 . 3, NIH_BMAP_M_S3 . 2 , and 
NIH_BMAP_M_S3 . 1 clones from which 3* ESTs had been derived 
was used as a driver in a hybridization with a pool of 
the NIH_BMAP_M_S3.3, NIH_BMAP__M_S3 . 2, and NIH_BMAP_M_S3 . 1 
libraries in the form of single-stranded circles. The 
remaining single-stranded circles (subtracted library) 
was purified by hydroxyapatite column chromatography, 
converted to double-stranded circles and electroporated 
into DH10B bacteria (Lif eTechnologies ) to generate the 
NIH_BMAP__M_S4 library. This procedure has been previously 
described (Bonaldo, Lennon and Soares, Genome Research 
6:791-806, 1996) 
TAG_L I B=N I H_BMAP_M_S 4 
TAG_TISSUE=pineal-glands 
TAG_SEQ=CAGAC" 
86 a 112 c 124 g 177 t 



Query Match 61.6%; 
Best Local Similarity 82.3%; 
Matches 396; Conservative 



Score 322.2; DB 9; 
Pred. No. 4e-70; 
0; Mismatches 78; 



Length 499; 
Indels 7; 



Gaps 



2; 



Qy 



Db 



50 T GT GT GGAGAC AG GGGCT T TT AT TT CAACAAG CC CACAGGAT ACGGCT CC AGC AGT C GGA 109 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
499 TGTGTGGACCGAGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGA 440 



Qy 

Db 

Qy 

Db 

Qy 
Db 



110 GGGC AC CT CAGACAGGCAT C GT GGAT GAGT GCT GCTT CC GGAGCTGT GAT CTGAGGAG GC 169 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
439 G GGCAC CT CAGACAGGCAT T GT GGAT GAGT GT T GCTT CC GGAGCT GT GAT CTGAGGAGAC 38 0 

170 TGGAGATGTACTGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGC 22 9 

I I I I I I I I I II I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
379 TGGAGATGTACTGTGCCCCACTGAAGCCTACAAAAGCAGCCCGCTCTATCCGTGCCCAGC 320 

230 GCCACACCGACAT GCCCAAGACTCAGAAGTAT CAGCCTCCATCTACCAACAAGAAAATGA 2 8 9 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II II I I I I II I I I I II 
319 G C C AC ACT GACAT GC C CAAGACT C AGAAGT C C C C GT C C CT AT C GACAAACAAGAAAAC GA 260 



Qy 



290 AGT CT CAGAGGAGAAGGAAAGGAAGTACATTT GAAGAACACAAGTAGAGGGAGT GCAGGA 34 9 



Db 



259 



200 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



350 AACAAGAACTACAGGAT GTAGGAAGACCCTT CT GAGGAGT GAAGAAGGACAGGCCACCGC 409 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
199 AACAAGAC CT AC AGAAT GT AGGAGGAGC CT C C CACGGAGC AGAAAAT GC CACAT CAC C GC 140 

410 AGGAC CCT TT GCT CT GCACAGT T ACCT GTAAACATT GGAAT AC C GGC CA AAAAA 463 

I I I I I I I I I I I I I I II I I I I I I I I I III I I I I 

139 AGGAT CCT TT G CT GCTT GAGCAACCT GCAAAACAT C GAAAC AC CT AC CAAATAACAATAA 80 



4 64 TAAGT TT GAT CACAT TT CAAAGAT - GGCATTT CCCCCAAT GAAATACACAAGTAAACATT 
I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
7 9 TAAGT CCAATAACAT T ACAAAGAT GGGC ATTT C C C C CAAT GAAAT ATACAAGT AAAC ATT 

523 C 523 
I 

19 C 19 



522 



20 



RESULT 10 

AI876493/c 

LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
COMMENT 



FEATURES 

source 



AI876493 642 bp mRNA linear EST 21-JUL-1999 

uj59bl0.xl Sugano mouse liver mlia Mus musculus cDNA clone 
IMAGE: 1924219 3* similar to gb : X57025_rnal INSULIN-LIKE GROWTH 
FACTOR IA PRECURSOR (HUMAN); gb:X04482 Mouse mRNA for 
preproinsulin-like growth factor IB (MOUSE);, mRNA sequence. 
AI876493 

AI876493.1 GI:5550542 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 642) 

Marra,M., Hillier,L., Kucaba,T., Martin, J., Beck,C, Wylie,T., 

Underwood, K. , Steptoe,M., Theising,B., Allen, M. , Bowers, Y. , Person 

,B., Swaller,T., Gibbons, M. , Pape,D., Harvey, N., Schurk,R., Ritter 

,E., Kohn,S., Shin,T., Jackson, Y., Cardenas, M., McCann,R., 

Waterston,R. and Wilson, R. 

The WashU-NCI Mouse EST Project 1999 

Unpublished 

Contact: Marra M/WashU-NCI Mouse EST Project 1999 
Washington University School of Medicine 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108, USA 
Tel: 314 286 1800 
Fax: 314 286 1810 

Email: mouseest@watson.wustl.edu 

This clone is available royalty-free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
MGI: 980511 

Seq primer: custom primer used 
High quality sequence stop: 257. 

Location/Qualif iers 

1. .642 

/organism="Mus musculus" 
/mol type="mRNA" 



BASE COUNT 
ORIGIN 



/strain="C57BL" 
/db_xref="taxon: 10090" 
/clone="IMAGE: 1924219" 
/sex="f emale" 
/dev_stage="adult" 
/lab_host="DHlOB" 

/clone_lib="Sugano mouse liver mlia" 

/note="Organ: liver; Vector: pME18S-FL3; Site_l: Drain 
(CACTGTGTG); Site_2 : Drain (CACCATGTG) ; 1st strand cDNA 
was primed with an oligo(dT) primer 

[ATGTGGCCTTTTTTTTTTTTTTTTT] ; double-stranded cDNA was 
ligated to a Drain adaptor [TGTTGGCCTACTGG] , digested 
and cloned into distinct Drain sites of the pME18S-FL3 
vector (5' site CACTGTGTG, 3' site CACCATGTG). Xhol should 
be used to isolate the cDNA insert. Size selection was 
performed to exclude fragments <1.5kb. Library 
constructed by Dr. Sumio Sugano (University of Tokyo 
Institute of Medical Science) . Custom primers for 
sequencing: 5' end primer CTTCTGCTCTAAAAGCTGCG and 3' end 
primer CGAC CT GC AGCT C GAGCAC A . " 
127 a 154 c 175 g 185 t 1 others 



Query Match 61.3%; 
Best Local Similarity 80.1%; 
Matches 4 03; Conservative 



Score 320.8; DB 9; 
Pred. No. 9.6e-70; 
0; Mismatches 93; 



Length 642; 
Indels 7; 



Gaps 



2; 



QY 



Db 



503 



GACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGACA 61 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGATGCTCTTCAGGTCGTGTGTGGACCGA 444 



QY 
Db 

Qy 

Db 



62 



443 



122 



383 



GGGGCT TT TAT T T CAACAAGC CCAC AGGAT ACGGCT C C AGCAGT CGGAGGGC AC CT C AGA 
I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I 
GGGGCTTTTTCTT CAACAAGGC CACAG GCT AT GGCT C C AGCAT TT GGAGGGCAC CT C AGA 



121 



384 



180 



CAGGCAT CGTGGAT GAGTGCTGCTTCCGG- AGCT GT GATCT GAGGAGGCTGGAGAT GT AC 
I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I II III I I I I I I I I 
C AGT CAAT GT GGAT GAGT GTT GCT T C C GGAAGCT GT GAT CT GAGAAGACT GN AGAT GT AC 324 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 



181 



323 



241 



263 



301 



203 



361 



143 



TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 

I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

T GT GCC C C ACT GAAGC CT ACAAAAGCAGC CC GCT CT AT C CGT GC CCAG C GC CACACT GAC 

AT GC CCAAGACT CAGAAGT AT CAGCCT C CAT CT AC CAACAAGAAAAT GAAGT CT CAGAGG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ATGCCCAAGACT CAGAAGT CCCCGTCC CTAT C GACAAACAAGAAAACGAAGCT GCAAAGG 



240 



264 



300 



204 



360 



AGAAGGAAAGGAAGTAC AT TT GAAGAACACAAGTAGAGGGAGT G CAGGAAACAAGAACT A 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
AGAAGGAAAGGAAGTACATTT GAAGAACACAAGTAGAGGAAGT GCAGGAAACAAGACCTA 144 



CAGGAT GTAGGAAGAC C CT T CT GAGGAGT GAAGAAGGACAGGC CAC C G CAGGACC CT TTG 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 

CAGAAT GTAGGAGGAGC CT C C CAC G GAGCAGAAAAT GC CACAT CAC C GCAGGAT CCT T T G 



421 CT CT GCACAGT T ACCT GT AAAC AT T GGAATAC CGGC CA- 
II I I I I I I I I I I I I I III 



- AAAAATAAGTTT GAT C 
II I I I I I I I II 



420 



84 



474 



Db 83 CT GCTT GAGCAACCTGCAAAACAT CGAAACACCTACCAAATAACAATAATAAGT CCAATA 24 



Qy 



Db 



475 ACATTT CAAAGAT GGCATTT C CC 497 

I I I I I I I I I I I I I I II II 
23 ACATTACAAAGATGGGCATTT CC 1 



RESULT 11 

BM984670/c 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



FEATURES 

source 



BM984670 673 bp mRNA linear EST 20-FEB-2003 

UI-CF-ECl-abj-k-24-0-UI.sl UI-CF-EC1 Homo sapiens cDNA clone 
UI-CF-ECl-abj-k-24-0-UI 3', mRNA sequence. 
BM984670 

BM984670. 1 GI: 19610417 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 673) 

Bonaldo, M. F. , Lennon,G. and Soares,M.B. 

Normalization and subtraction: two approaches to facilitate gene 
discovery 

Genome Res. 6 (9), 791-806 (1996) 

97044477 

8889548 

Contact: McCray, PB 
McCray Lab 
University of Iowa 

2024 University of Iowa Med Labs, Iowa City, IA 52242, USA 

Tel: 319 356 4866 

Fax: 319 356 7171 

Email: paul-mccray@uiowa.edu 

Tissue Procurement: Dr. M. J. Welsh, University of Iowa 
cDNA Library preparation: Dr. M. Bento Soares, University of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Soares, University of Iowa 
DNA Sequencing by: Dr. M. Bento Soares, University of Iowa 
Clone Distribution: Researchers may obtain clones from Research 

Genetics (www.resgen.com) or from Open Biosystems 

(www.openbiosystems.com) . 

Seq primer: M13 FORWARD 

P0LYA=Yes . 

Location/Qualifiers 
1. .673 

/organism="Homo sapiens" 
/mol_type= ,, mRNA" 
/db_xref="taxon:9606" 
/clone="UI-CF-ECl-abj-k-24-0-UI" 
/ 1 i s sue_type= " Lung " 
/devjstage="Adult and Fetal" 

/lab_host="DH10B (Life Technologies) (Tl phage resistant)" 
/clone_lib="UI-CF-ECl" 

/note="0rgan: Lung; Vector: pT7T3-Pac (Pharmacia) with a 
modified polylinker; Site_l: EcoR I; Site_2 : Not I; 
UI-CF-EC1 is a normalized cDNA library containing the 
following tissue(s): Normal lung from adult and from fetal 
day 64, day 87, week 19 and week 42. The library was 



BASE COUNT 
ORIGIN 



152 



constructed according to Bonaldo, Lennon and Soares, 
Genome Research, 6:791-806, 1996. First strand cDNA 
synthesis was primed with an oligo-dT primer containing a 
Not I site. Double stranded cDNA was ligated to an EcoR I 
adaptor, digested with Not I, and cloned directionally 
into pT7T3-Pac vector. The oligonucleotide used to prime 
the synthesis of first-strand cDNA contains a library tag 
sequence that is located between the Not I site and the 
(dT)18 tail. The sequence tag for this library is 
AAGTGCTTAC . 
TAG_LIB=UI-CF-EC1 

T AG_T ISSUE-No r ma 1 Lung Epithelial Cells Tissue nos 369-371 
and 380-383 
T AG_S EQ=AAGTGCTT AC " 
a 164 c 169 g 188 t 



Query Match 61.3%; 
Best Local Similarity 84.2%; 
Matches 44 3; Conservative 



Score 320.8; DB 12; Length 673; 
Pred. No. 9.7e-70; 
0; Mismatches 27; Indels 56; 



Gaps 



5; 



Qy 


1 


Db 


492 


Qy 


61 


DO 




Qy 


121 


Db 


373 


Qy 


181 


Db 


313 


Qy 


241 


Db 


253 


Qy 


301 


Db 


238 


Qy 


361 


Db 


185 


Qy 


421 


Db 


125 


Qy 


479 


Db 


65 



GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 433 

AGGGGCT T T T AT TT CAACAAGCC CACAGGAT AC G GCT C CAGC AGT CGGAG GGCAC CT CAG 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I M I I I I I I I II I 

AGGGG-TTTTATTTCAGCAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 374 

ACAGGC AT C GT GGAT GAGT GCT G CTT CC GGAGCT GT GAT CT GAGGAGGCT GGAGAT GT AC 180 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
ACAGGCAT CGT GGAT GAGT GCT GCTTCCGGAGCTGTGAT CTAAGGAGGCT GGAGAT GT AT 314 

TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

II I I I I I I M I I I I I I II III I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 254 

ATGCCCAAGACTCAGAAGTATCAGCCTCCAT CTACCAACAAGAAAATGAAGTCT CAGAGG 300 
I I I I I I I I I I I III 

AT GCC CAAGAC C CAG 239 

AGAAGGAAAGGAAGTACATTT GAAGAACACAAGTAGAGGGAGT GCAGGAAACAAGAACT A 360 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AAGGAAGT AC AT T T GAAGAAC GCAAGTAGAGGGAGT GCAGGAAACAAGAACT A 186 

CAGGAT GT AGGAAGAC C CT T CT GAGGAGT GAAGAAGGACAGGC CAC CGC AGGAC C CTT T G 42 0 

I | M I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CAGGAT GT AGGAAGAC C CT C CT GAGGAGT GAAGAGT GACAT GC C AC CGC AGGAT C CT T TG 126 

CT CT GC AC - AGT T ACCT G- TAAAC AT T GGAAT AC CGGCCAAAAAATAAGT TT GAT C ACAT 47 8 

M I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I 

CT CT G CAC GAGTT AC CT GT T AAACT TT GGAACAC CT AC CAAAAAAT AAGT T T GATAACAT 6 6 

TT CAAAGAT- GGCATTT CCCCCAAT GAAAT ACACAAGTAAACATT C 523 

II I I I I I I III I I I I I I I I I I I I I I II I I I I I I I I M I I I I I I 

TT AAAAGAT GGGC GTT T C C C C CAAT GAAAT ACACAAGTAAACATT C 2 0 



RESULT 12 

AI248089/C 

LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



AI248089 575 bp mRNA linear EST 01-DEC-1998 

qh69f05.xl Soares_f etal__liver_spleen_lNFLS_Sl Homo sapiens cDNA 
clone IMAGE: 1849953 3' similar to gb : X57025_rnal INSULIN-LIKE 
GROWTH FACTOR IA PRECURSOR (HUMAN);, mRNA sequence. 
AI248089 

AI248089.1 GI:3843486 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 575) 

NCI-CGAP http : //www. ncbi . nlm. nih . gov/ncicgap . 

National Cancer Institute, Cancer Genome Anatomy Project (CGAP) , 

Tumor Gene Index 

Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail.nih.gov 

This clone is available royalty-free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
Insert Length: 918 Std Error: 0.00 
Seq primer: -4 0UP from Gibco 
High quality sequence stop: 380. 

Location/Qualifiers 

1. .575 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clone="IMAGE: 1849953" 
/sex="male" 

/dev stage="20 week-post conception fetus" 
/lab~host= M DH10B (ampicillin resistant) " 
/clone_lib="Soares_fetal_liver_spleen_lNFLS_Sl" 
/note="0rgan: Liver and Spleen; Vector: pT7T3D (Pharmacia) 
with a modified polylinker; Site_l: Pac I; Site_2: Eco RI ; 
This is a subtracted version of the original Soares fetal 
liver spleen 1NFLS library. 1st strand cDNA was primed 
with a Pac I - oligo(dT) primer [5 f 
AACTGGAAGAATTAATTAAAGATCTTTTTTTTTTTTTTTTTTT 3* ] , 
double-stranded cDNA was ligated to Eco RI adaptors 
(Pharmacia), digested with Pac I and cloned into the Pac I 
and Eco RI sites of the modified pT7T3 vector. Library 
went through one round of normalization. Library 
constructed by Bento Soares and M.Fatima Bonaldo." 
135 a 152 c 131 g 156 t 1 others 



Query Match 60.6%; 
Best Local Similarity 83.8%; 
Matches 428; Conservative 



Score 316.8; DB 9; 
Pred. No. 9.4e-69; 
0; Mismatches 28; 



Length 575; 
Indels 55; 



Gaps 



4; 



Qy 



Db 



16 TGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGACAGGGGCTTTTATTTC 75 

I I I I I I I I I I I I I I II I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I 
551 TGCGGGGCTGAGCTGGTGNATGCTCTTCAGTTCGTGTGTGAAGACAGGGGCTTTTATTTC 4 92 



Qy 7 6 AACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAGACAGGCATCGTGGAT 135 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4 91 AACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAGACAGGCATCGTGGAT 432 

Qy 136 GAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTACTGTGCACCCCTCAAG 195 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 

Db 4 31 GAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTATTGCGCACCCCTCAAG 372 

Qy 196 CCGGCAAAG GCAGC C C GCT CCGT CC GT GC C C AGC GC C ACAC CGACAT GC C CAAGACT C AG 255 

II II III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

Db 371 CCT GC CAAGT CAGCT CGCTCTGTCCGTGCC C AGC GC C ACAC CGACAT GC C CAAGAC C C AG 312 

Qy 256 AAGTATCAGCCT CCAT CTACCAACAAGAAAAT GAAGT CT CAGAGGAGAAGGAAAGGAAGT 315 

I I I I I I I I 

Db 311 AAGGAAGT 304 

Qy 316 ACATTT GAAGAACACAAGT AGAGGGAGT GCAGGAAACAAGAACT ACAGGAT GT AGGAAGA 375 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 303 ACATTT GAAGAACG CAAGT AGAGGGAGT GCAGGAAACAAGAACT ACAGGAT GT AGGAAGA 244 

Qy 376 CCCTT CT GAGGAGT GAAGAAGGACAGGC C AC C GCAGGAC CCTTTGCTCT GCAC - AGT T AC 434 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 243 CCCT C CT GAGGAGT GAAGAGT GACAT GC C AC C GC AGGAT C CT TTGCT CT GCAC GAGT T AC 184 

Qy 435 CTG-TAAACATT GGAATACCGGCCAAAAAATAAGTTTGAT CACATTT CAAAGAT - GGCAT 492 

III I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I II I I I I I I I I I III I 

Db 183 CTGTTAAACTTT GGAACACCTACCAAAAAATAAGTTT GATAACATTTAAAAGAT GGGCGT 124 

Qy 4 93 TT C C C C C AAT GAAAT ACAC AAGT AAACAT T C 523 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 123 T T C C C C C AAT GAAAT AC AC AAGT AAACAT T C 93 



RESULT 13 

AI169770/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
COMMENT 



AI169770 468 bp mRNA linear EST 20-JAN-1999 

EST215669 Normalized rat liver, Bento Soares Rattus sp. cDNA clone 
RLIAT07 3' end, mRNA sequence. 
AI169770 

AI169770.1 GI:3709810 
EST. 

Rattus sp. 
Rattus sp. 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 
Rattus . 

1 (bases 1 to 468) 

Lee, N . H . , Glodek,A., Chandra, I., Mason, T.M., Quackenbush, J. , 
Kerlavage, A. R. and Adams, M.D. 

Rat Genome Project: Generation of a Rat EST (REST) Catalog & Rat 

Gene Index 

Unpublished 

Other_ESTs: TC50779 

Contact: Lee, NH 

The Institute for Genomic Research 

9712, Medical Center Drive, Rockville, MD 20850, USA 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Tel: (30D-838-3529 
Fax: (30D-838-0208 
Email : nhlee@tigr . org 
Seq primer: M13-21. 

Location/Qualifiers 

1. .468 

/organism="Rattus sp . " 
/mol_type="mRNA" 

/db_xref="ATCC (inhost) : 2 02757 0" 
/db_xref="taxon: 10118" 
/ clones" RLI AT 07" 

/clone_lib="Normalized rat liver , Bento Scares" 
/note="Organ: liver; Vector: pT7T3Pac; Site_l: EcoRI; 
Site_2: NotI" 
85 a 115 c 119 g 149 t 



Query Match 59.1%; 
Best Local Similarity 81.8%; 
Matches 383; Conservative 



Score 309.2; DB 9; 
Pred. No. 7.2e-67; 
0; Mismatches 78; 



Length 468; 
Indels 7; 



Gaps 



2; 



Qy 



Db 



63 GGGCTTTT AT T T CAACAAGC C CACAGGAT AC GGCT C C AGCAGT CG GAG GGC AC CT CAGAC 122 
I I I I I II I I I I I I I I I I I I I I I I I I I II I M I I I I I I I I I I I I I I I I I I I I I I M 
4 68 G GGCT T T TACT T CAACAAGC C CAC AG GCT AT GGCT CCAGCAT T C GGAGGGCAC CACAGAC 409 



Qy 

Db 

Qy 

Db 



123 



408 



183 



348 



AGGCAT C GT GGAT GAGTGCT GCT T CC GGAGCT GT GAT CT GAGGAGGCT GGAGAT GTACT G 

Mill I I I I I II I I I I I I I I I I I I I I I I I M I I I II I I I I I I I I I I I I I I I I I I I 

GGGC AT T GT G GAT GAGT GT T GCT C CC GGAGCT GT GAT CT GAG GAG GT T GGAGAT GTACT G 



182 



349 



242 



TGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGACAT 

Ml II II I I I I I I I I I I MM M III MM II II I II II I I I I I I I I II 

TGCTCCGCT GAAGC CTACAAAGT CAGCT C GT T CC AT C C GGGC C C AGC GCCACACT GACAT 28 9 



Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 



243 



288 



303 



228 



363 



168 



423 



108 



GCC CAAGACT CAGAAGT AT CAGC CT C CAT CT AC CAACAAGAAAAT GAAGT CT CAGAGGAG 

I I I I I I I II I II I I II I II I II I Ml M I II II I II I II II II Mill 

GCCCAAGACT CAGAAGTCCCAGCCCCTAT CGACACACAAGAAAAGGAAGCT GCAAAGGAG 



302 



229 



362 



AAGGAAAGGAAGT ACAT T T GAAGAACACAAGTAGAGGGAGT GC AGGAAACAAGAACTAC A 

II I II M I II I I II II I I M I II I II I I II II I I I I II I I I I I I II I I II II II II I 

AAGGAAAGGAAGT ACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTACA 169 
GGAT GT AGGAAGAC CCTT CT GAGGAGT GAAGAAGGAC AGGC CAC C GCAGGAC C CT TT GCT 422 

I I I II I I I I I II I I I II I II I I II M II II I I II I I M 

GAAT GTAGGAGGAGCCT C C C GAGGAACAGAAAAT G C C ACGT CACC GCAAGAT C CTTT GCT 109 

CT GCACAGT T AC CT GTAAACAT T GGAAT AC CG GCC A AAAAAT AAGT T T GAT CAC 476 

I I I I II I I II I I I I I I II II I II II I I II I 

GCT T GAGCAAC CT GCAAAAC AT CGGAAC AC CT GC CAAATAT CAAT AAT GAGT T CAAT AC C 4 9 



Qy 



Db 



477 ATTT CAAAGAT - GGCAT T T CC C CCAAT GAAAT AC ACAAGT AAAC AT T C 523 
I I I I II MM I I II I I I I II I II I I I II I II I I II I I II I I I I I I 
48 AT T T CAGAGAT GGGCATTT C C CT CAAT GAAAT AC ACAAGT AAAC AT T C 1 



RESULT 14 

AA542914/c 

LOCUS AA542914 



498 bp mRNA linear EST 19-AUG-1997 



DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



ni98cl0.sl NCI_CGAP_Pr21 Homo sapiens cDNA clone IMAGE: 984882 3 1 
similar to gb : X57025_rnal INSULIN-LIKE GROWTH FACTOR IA PRECURSOR 
(HUMAN);, mRNA sequence. 
AA542914 

AA542914.1 GI:2291394 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 498) 

NCI-CGAP http : //www. ncbi . nlm.nih.gov/ncicgap. 

National Cancer Institute, Cancer Genome Anatomy Project (CGAP) , 

Tumor Gene Index 

Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email : cgapbs-r@mail . nih . gov 

Tissue Procurement: Michael J. Brownstein, M.D., Ph.D., Michael R. 
Emmert-Buck, M.D., Ph.D. 

cDNA Library Preparation: M. Bento Soares, Ph.D. 

cDNA Library Arrayed by: Greg Lennon, Ph.D. 

DNA Sequencing by: Washington University Genome Sequencing Center 
Clone distribution: NCI-CGAP clone distribution information can be 
found through the I.M.A.G.E. Consortium/LLNL at: 
www-bio . llnl . gov/bbrp/image/image . html 
Insert Length: 603 Std Error: 0.00 
Seq primer: -40ml3 fwd. ET from Amersham 
High quality sequence stop: 412. 
Location/ Qualifiers 
1. .498 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon:9606" 
/clone="IMAGE: 984882" 
/sex="male" 

/tissue_type=" normal prostate" 

/lab_host="DH10B" 

/ clone_lib= "NCI__CGAP_Pr2 1 " 

/note="Organ: prostate; Vector: pT7T3D-Pac (Pharmacia) 
with a modified polylinker; 1st strand cDNA was prepared 
from normal prostate bulk tissue, and was then primed with 
a Not I - oligo(dT) primer. Double-stranded cDNA was 
ligated to Eco RI adaptors (Pharmacia), digested with Not 
I and cloned into the Not I and Eco RI sites of the 
modified pT7T3 vector. Library is not normalized. Library 
was constructed by Bento Soares and M. Fatima Bonaldo. " 
105 a 135 c 123 g 135 t 



Query Match 58.3%; Score 305; DB 9; Length 498; 

Best Local Similarity 83.5%; Pred. No. 8.2e-66; 

Matches 4 40; Conservative 0; Mismatches 30; Indels 57; Gaps 6; 

Qy 1 GGACCGGAGACGCTCTGCGGTGC-TGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGA 59 

I I I M I I I I I I I I I I I M I I II II I I II I I I M II I I I I I I I I II I I I I I I I I I 

Db 47 6 GGACCGGAGAACTTTTGCGGGGCTTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGA 417 



Qy 60 CAGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCA 119 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I II I I I I I I I I 
Db 416 CAGGGGC-TTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCA 358 

Qy 120 GACAGGC AT C GT GGAT GAGT GCT GCT T C C GGAGCT GT GAT CT GAGGAGGCT GGAGAT GTA 17 9 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I M II I I 
Db 357 GACAGGC AT C GT GGAT GAGT GC T GCT T C C GGAGCT GT GAT CTAAGGAGGCT GGAGAT GTA 298 

Qy 180 CTGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGA 239 

II I I I I I I II I I I I I I II III I I I I I I 11 I I I I I I I I I I I II I I I I I I I I I I I 

Db 297 TTGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGA 238 

Qy 240 CAT GCC CAAGACT C AGAAGT AT C AGCCT C CAT CT AC CAACAAGAAAAT GAAGTCT CAGAG 299 

I I I I I I I I I I I I III 

Db 237 CAT GCC C AAGAC C C AG 222 

Qy 300 GAGAAGGAAAGGAAGT ACAT T T GAAGAACACAAGT AGAGGGAGT GCAGGAAACAAGAACT 359 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I 
Db 221 AAGGAAGT ACAT T T GAAGAAC GCAAGT AGAGGGAGT GCAGGAAACAAGAACT 170 

Qy 360 ACAGGAT GTAGGAAGAC CCTT CTGAGGAGT GAAGAAGGACAGGCCACCGCAGGACCCTTT 419 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 169 ACAGGAT GTAGGAAGAC CCT C CT GAGGAGT GAAGAGT GACAT GC CAC C GC AGGAT C CT T T 110 

Qy 420 GCT CT GC AC - AGT T ACCTG- TAAAC AT T G GAAT AC C GGCCAAAAAATAAGT T T GAT C ACA 477 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Ml I I I I I I I I I I I I I I I I M III 
Db 109 GCT CT GC AC GAGT T AC CT GT T AAACTT T GGAACAC CT ACCAAAAAAT AAGT T TGATAACA 50 

Qy 47 8 T TTCAAAGAT - GGC AT T TC C C CCAAT GAAAT AC ACAAGTAAACAT T C 523 

Ml I I I I I I III I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 4 9 T T TAAAAGATGGGC GTT TC C C C CAAT GAAAT ACACAAGTAAAC AT T C 3 



RESULT 15 

AI604642 

LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
COMMENT 



AI604642 882 bp mRNA linear EST 21-APR-1999 

vm43d08.yl Stratagene mouse diaphragm (#937303) Mus musculus cDNA 
clone IMAGE: 1001007 5' similar to gb:M11568 INSULIN-LIKE GROWTH 
FACTOR IB PRECURSOR (HUMAN); gb:X04482 Mouse mRNA for 
preproinsulin-like growth factor IB (MOUSE);, mRNA sequence. 
AI604642 

AI 604642 . 1 GI : 4 613809 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 882) 

Marra,M., Hillier,L., Kucaba,T., Martin, J., Beck,C, Wylie,T., 

Underwood, K. , Steptoe,M*, Theising,B., Allen, M., Bowers, Y., Person 

, B., Swaller,T., Gibbons, M. , Pape,D., Harvey, N . , Schurk,R., Ritter 

,E., Kohn,S., Shin,T., Jackson, Y., Cardenas, M. , McCann,R., 

Waterston,R. and Wilson, R. 

The WashU-NCI Mouse EST Project 1999 

Unpublished 

Contact: Marra M/WashU-NCI Mouse EST Project 1999 
Washington University School of Medicine 



4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108, USA 

Tel: 314 286 1800 

Fax: 314 286 1810 

Email : mouseest@watson . wustl . edu 

This clone is available royalty-free through LLNL ; contact the 
IMAGE Consortium ( inf o@image • llnl . gov) for further information. 
MGI: 565223 

This read is a RESEQUENCE of a previously sequenced mouse clone 
This read has been verified (found to hit its original self in the 
correct orientation) 
Seq primer: -4 0RP from Gibco 
High quality sequence stop: 361. 
FEATURES Location/Qualif iers 

source 1. .882 

/organism="Mus musculus" 

/mol_type="mRNA" 

/db_xref="taxon: 10090" 

/clone="IMAGE: 1001007" 

/tissue_type="diaphragm" 

/ de v_s t age= " adul t " 

/lab host="SOLR (kanamycin resistant) " 
/clone_lib="Stratagene mouse diaphragm (#937303)" 
/note="Organ: diaphragm; Vector: pBluescript SK-; Site_l: 
. EcoRI; Site_2: Xhol ; Cloned unidirectionally from mRNA, 
prepared from diaphragm muscle. Primer: Oligo dT. Average 
insert size: 1.5 kb. Uni-ZAP XR Vector; ~5* adaptor 
sequence: 5 f GAATT CGGCACGAG 3' -3' adaptor sequence: 5* 
CTCGAGTTTTTTTTTTTTTTTTTT 3 1 " 

BASE COUNT 236 a 226 c 183 g 223 t 14 others 

ORIGIN 

Query Match 57.2%; Score 299; DB 9; Length 882; 

Best Local Similarity 77.2%; Pred. No. 3e-64; 

Matches 393; Conservative 0; Mismatches 109; Indels 7;. Gaps 3; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 377 GGACCAGAGACCCTTTGCGGNGCTGAGCTGNTGGATGCTCTTCAGTTCGTGTGTGGACCG 436 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 437 AGGGGCTTTTACTTC7\ACAAGCCCACAGGCTATGGCTCCAGCATTTCGAGGGCACCTCAG 496 

Qy 121 ACAGGC AT CGT GGAT GAGT GCT GCT T C C GGAGCT GT GAT CT GAGGAGGCT GGAGAT GT AC 18 0 

I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I III I I I I I I II I I I I 
Db 4 97 ACAGGC AT TGT GGAT GAGT GT T GC T T C C GGAGCT GT GAT CT GANGAGACT GGAGAT GT AC 556 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

I I I I I II II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

Db 557 T GT GC C C CACT GAAGCCTACANAAGCAGC CCG CT CTAT C C GT GCC C AGCGCCAC ACT GAC 616 

Qy 241 AT GC C CAAGACT C AGAAGT AT CAG C CT C CAT CT AC CAACAAGAAAAT GAAGT C T CAGAGG 300 

I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I III 
Db 617 AT GCNNCAGACT C AGAAGT CCC CGTN C CT AT C GACNNAAC AGAAAAC GAAGCT T GCAAGG 676 



Qy 



301 AGAAGGAAAGGAAGT ACAT T T GAAGAACACAAGT AGAGGGAGT GCAGGAAACAAGAACT A 360 
Ml I I I I I I I I I I I I I I I I I I I I I I II I II I MMM I I I I I I I I I III 



Db 



677 GAGAGGGAAGGGAGT AC ATT T GAGGAAC ACAN GTN GAGGAAGT GCAN GAAACAAGAC CT A 736 



Qy 361 CAGGATGTAGGAAGAC C CTT CT GAGGAGT GAAGAAGGAC AGGCCACC GCAGGAC C CTT T G 420 

I I I I I I I I I I I I I I I I I I II I I M I I 

Db 737 CCAGATGTANGAGGAGCCTTC — AAC C GAGCAGAAAT GC AC AT CACC C GNAGAT C CT T T G 794 

Qy 421 CT CT G C AC AGT T AC CT GTAAAC AT T GGAATAC C GGC CAA AAAAT AAGTT T GAT CAC 476 

II I I I I I I I I I I II I I I II I II I III I III I 

Db 795 CT - GCTTGAGCTAACT GCAAACAT CGAACCACTTACCAATTACAATAAT AAGTT CAATAA 853 

Qy 477 AT TT CAAAGAT GGCATT T CC CC CAAT GAA 505 

I I I I I I I I I I I II I I I I I I I I I I I I 
Db 854 CAT ACAAAGAT GGCATTT CC C C C ATT GAA 882 



Search completed: December 13, 2003, 07:29:51 
Job time : 1704.81 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



Searched: 



December 13, 2003, 05:41:20 ; Search time 2336.77 Seconds 

(without alignments) 
9156.102 Million cell updates/sec 

US-09-852-261-5 
523 

1 ggaccggagacgctctgcgg aaatacacaagtaaacattc 523 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



2888711 seqs, 20454813386 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



5777422 



Database 



GenEmbl : 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 



gb_ba : * 
gb_htg : * 
gb_in : * 
gb_om : * 
gb_o v : * 
gb_pat : * 
gb_ph : * 
gb_pl : * 
gb__pr : * 
gb_ro : * 
gb__sts : * 
gb_sy : * 
gb__un : * 
gb__vi : * 
em_ba : * 
em_f un : * 
em_hum : * 
em_in : * 
em_mu : * 
em_om : * 
em__or : * 
em_o v : * 
em_pat : * 
em_ph : * 
em_pl : * 
em_ro : * 
em sts:* 



28 


em 


un : * 


29 


em 


vi : * 


30 


em 


htg hum: * 


31 


em 


htg inv: * 


32 


em 


htg other:* 


33 


em 


htg mus : * 


34 


em 


htg pin : * 


35 


em 


htg rod:* 


36 


em 


htg mam: * 


37 


em 


htg vrt : * 


38 


em 


sy : * 


39 


em 


htgo hum: * 


40 


em 


htgo mus : * 


41 


em_ 


_htgo_other : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



fo. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


523 


100. 


0 


523 


6 


AX147746 


AX147746 Sequence 


2 


523 


100. 


0 


523 


6 


AX300783 


AX300783 Sequence 


3 


467. 


4 


89 . 


4 


517 


6 


AX147742 


AX147742 Sequence 


4 


467 


4 


89. 


4 


517 


6 


AX300779 


AX300779 Sequence 


5 


409 


78. 


2 


471 


6 


AX147754 


AX147754 Sequence 


6 


409 


78. 


2 


471 


6 


AX300791 


AX300791 Sequence 


7 


364 


8 


69. 


8 


1536 


10 


BC012409 


BC012409 Mus muscu 


8 


361 


6 


69. 


1 


798 


10 


RNIGFI2 


X06108 Rat mRNA (c 


9 


361 


6 


69. 


1 


958 


10 


RNIGFI1 


X06107 Rat mRNA (c 


10 


358 


4 


68. 


5 


710 


10 


RATIGFIA 


M15480 Rat insulin 


11 


356 


8 


68. 


2 


539 


6 


AX147744 


AX147744 Sequence 


12 


356 


8 


68. 


2 


539 


6 


AX300781 


AX300781 Sequence 


13 


349 


4 


66. 


8 


651 


10 


MMIGFIBR 


X04482 Mouse mRNA 


14 


336 


64. 


2 


432 


4 


AF022961 


AF022961 Oryctolag 


15 


334 


4 


63. 


9 


7260 


6 


AX375028 


AX375028 Sequence 


16 


334 


4 


63. 


9 


7260 


6 


AX411095 


AX411095 Sequence 


17 


334 


4 


63. 


9 


7260 


9 


HSIGFACI 


X57025 Human IGF- I 


18 


332 


8 


63. 


6 


666 


6 


A29119 


A29119 H. sapiens I 


19 


332 


8 


63. 


6 


725 


9 


HSIGFI 


X00173 Homo sapien 


20 


332 


8 


63. 


6 


728 


9 


HUMGFII 


M29644 Human insul 


21 


332 


4 


63. 


6 


616 


9 


HSIGF1A 


X56773 H. sapiens m 


22 


331 


2 


63. 


3 


620 


6 


108370 


108370 Sequence 2 


23 


331 


2 


63. 


3 


1076 


9 


HUMIGFI 


M27544 Human insul 


24 


316 


8 


60. 


6 


444 


9 


HSU40870 


U40870 Human alter 


25 


293 


8 


56. 


2 


532 


4 


SSILGF1M 


X17492 Porcine mRN 


26 


293 


8 


56. 


2 


567 


4 


PIGGFIIA 


M31175 Pig insulin 


27 


286 


2 


54. 


7 


888 


4 


ECU85272 


U85272 Equus cabal 


28 


281 


8 


53. 


9 


978 


4 


GOTIGFI 


D11378 Goat mRNA f 


29 


281 


8 


53. 


9 


978 


6 


E05279 


E05279 DNA encodin 


30 


280 


2 


53. 


6 


1284 


4 


BTILGF1A 


XI 572 6 Bovine mRNA 


31 


275 


4 


52. 


7 


747 


4 


SHPIGFIA6 


M31735 Sheep insul 


32 


275 


4 


52. 


7 


790 


4 


SHPIGFIA21 


M31734 Sheep insul 


33 


275 


4 


52. 


7 


1015 


4 


SHPIGFIA46 


M31736 Sheep insul 



34 


275. 


4 


52, 


.7 


1107 


4 


SHPIGFIA1 


M30653 Sheep insul 


35 


275. 


2 


52, 


,6 


730 


9 


HSIGF1B 


X56774 H. sapiens m 


36 


275. 


2 


52. 


.6 


1094 


9 


HUMGFIB 


M11568 Human insul 


37 


275. 


2 


52, 


.6 


1136 


6 


E01349 


E01349 cDNA encodi 


38 


275. 


2 


52, 


.6 


1136 


6 


108009 


10 8 009 Sequence 3 


39 


274. 


6 


52, 


.5 


3599 


6 


BD063790 


BD063790 Insulin-1 


40 


274. 


6 


52, 


.5 


3599 


6 


BD069041 


BD069041 Treatment 


41 


274. 


6 


52, 


.5 


3600 


6 


BD063789 


BD063789 Insulin-1 


42 


274. 


6 


52, 


.5 


3600 


6 


BD069040 


BD069040 Treatment 


43 


273. 


6 


52, 


.3 


612 


9 


HUMIGFIB 


M374 84 Human insul 


44 


271. 


6 


51, 


. 9 


696 


10 


MMIGFIAR 


X04480 Mouse mRNA 


45 


268. 


4 


51, 


.3 


513 


26 


RNMGJIL 


M17714 Rat insulin 



ALIGNMENTS 



RESULT 1 
AX147746 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 

FEATURES 

source 



CDS 



linear PAT 31-AUG-2001 



BASE COUNT 
ORIGIN 



AX147746 523 bp DNA 

Sequence 5 from Patent WO01364 83. 
AX147746 

AX14774 6. 1 GI: 14346791 



Oryctolagus cuniculus (rabbit) 
Oryctolagus cuniculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 

1 

Goldspink, G. R. and Johnson, I. R. 

Use of the insulin-like-growth factor i isoform mgf for the 
treatment of neurological disorders 
Patent: WO 0136483-A 5 25-MAY-2001; 
University College London (GB) 

Location/Qualifiers 

1. .523 

/organism="Oryctolagus cuniculus " 
/ mo l_type= "genomic DNA" 
/db_xref="taxon: 9986" 
<1. .336 

/note= "unnamed protein product" 
/ codon_s tart=l 
/protein_id="CAC41177 . 1" 
/db_xref="GI: 14346792" 
/ db_xref ="REMTREMBL : CAC41177" 

/ trans la tion= " GPETLCGAEL VDALQFVCGDRGFYFNKPTGYGS S SRRAPQTGI V 
DECCFRSCDLRRLEMYCAPLKPAKAARSVRAQRHTDMPKTQKYQPPSTNKKMKSQRRR 
KGSTFEEHK" 
154 a 129 c 142 g 98 t 



Query Match 100.0%; Score 523; DB 6; Length 523; 

Best Local Similarity 100.0%; Pred. No. 9.5e-150; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; 



Gaps 



0; 



Qy 



GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I II I I I I I I II I I I I I I I I I I I 



60 



Db 



1 



GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 



Qy 61 AG GG GCTT T T AT TT CAACAAGC C CACAG GAT AC GGCT C CAGC AGT CGGAGGGCACCTCAG 120 

I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

Qy 121 ACAGGCAT C GT G GAT GAGT GCT G CT T C C GGAGCT GT GAT CT GAGGAGGCT GGAGAT GT AC 180 

I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 ACAGGCAT C GT GGAT GAGT GCTGCTTCC GGAGCT GT GAT CT GAGGAGGCT GGAGAT GT AC 180 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

Qy 241 ATGCCCAAGACT CAGAAGTAT CAGCCT CCAT CTACCAACAAGAAAAT GAAGTCTCAGAGG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 AT GC CCAAGACT CAGAAGTAT CAGC CT CCAT CTACCAACAAGAAAAT GAAGT CT C AGAGG 300 

Qy 301 AGAAGGAAAGGAAGTACATTT GAAGAACACAAGT AGAGGGAGT GCAGGAAACAAGAACTA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 AGAAGGAAAGGAAGTACATTT GAAGAACACAAGT AGAGGGAGT GCAGGAAACAAGAACTA 360 

Qy 361 CAGGAT GTAGGAAGACCCTT CT GAGGAGT GAAGAAGGACAGGCCACCGCAGGACCCTTT G 420 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 CAGGAT GTAGGAAGACCCTT CT GAGGAGT GAAGAAGGACAGGCCACCGCAGGACCCTTT G 420 

Qy 421 CTCT GCAC AGT TAC CT GTAAACATT GGAAT AC C G GC CAAAAAATAAGT T T GAT CACATT T 480 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 CTCT GCAC AGT TAC CT GTAAACAT T GGAAT AC C GGC CAAAAAATAAGT T T GAT C AC ATT T 480 

Qy 481 CAAAGAT GGCAT TT C CC C CAAT GAAAT ACACAAGT AAACATT C 523 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 CAAAGAT GGCATTT C CC CCAAT GAAAT ACACAAGT AAACATT C 523 



RESULT 2 
AX300783 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



AX300783 523 bp DNA linear PAT 30-NOV-2001 

Sequence 5 from Patent WO0185781. 

AX300783 

AX300783. 1 GI: 17382064 

Oryctolagus cuniculus (rabbit) 
Oryctolagus cuniculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 

1 

Goldspink,G.D. and Terenghi, G. B. 

Repair of nerve damage 

Patent: WO 0185781-A 5 15-NOV-2001; 

University College London (GB) ; East Grinstead Medical- Research 
Trust (GB) 

Location/Qualifiers 

1. .523 

/organism="Oryctolagus cuniculus " 
/mol_type=" genomic DNA" 
/db xref="taxon: 9986" 



CDS <1. .336 

/ not e=" unnamed protein product" 
/ codon_start=l 
/protein_id="CADl3042 . 1" 
/db_xref="GI : 17382065" 

/ trans lation="GPETLCGAELVDALQFVCGDRGFYFNKPTGYGSSSRRAPQTGIV 
DECC FRS CDLRRLEMYCAPLKPAKAARSVRAQRHTDMPKTQKYQP P STNKKMKSQRRR 
KGSTFEEHK" 

BASE COUNT 154 a 129 c 142 g 98 t 

ORIGIN 

Query Match 100.0%; Score 523; DB 6; Length 523; 

Best Local Similarity 100.0%; Pred. No. 9.5e-150; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCTT T TAT T T CAACAAGC C CACAGGAT AC GGCT C CAGCAGT C GGAGGGCACCT CAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

Qy 121 ACAGGC AT C GT GGAT GAGT GCT GCTT C C GGAGCT GT GAT CT GAGGAGGCT GGAGAT GTAC 18 0 

I I I I I I I I I I I I I I I l-l I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 ACAGGCAT C GT GGAT GAGT GCT GCTT C C GGAGCT GT GAT CT GAGGAGGCT G GAGAT GTAC 180 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 

Db 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

Qy 241 AT GCCCAAGACT CAGAAGTAT CAGCCT CCATCTACCAACAAGAAAAT GAAGT CTCAGAGG 300 

I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 241 AT GCCCAAGACT CAGAAGTAT CAGCCT CCATCTACCAACAAGAAAAT GAAGTCTCAGAGG 300 

Qy 301 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

Qy 361 C AGGAT GT AGGAAGAC C CT T CT GAGGAGT GAAGAAGGAC AGGC CAC C GCAGGAC CCT TT G 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 C AGGAT GT AGGAAGAC C CT T CT GAGGAGT GAAGAAGGAC AGGC CAC C GCAGGACCCTTT G 420 

Qy 421 CT CT GC AC AGT T AC CT GTAAAC AT T GGAATAC C GGC CAAAAAATAAGT TT GAT CAC ATT T 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 CT CT GCAC AGT T AC CT GTAAACAT T GGAATACC G GC CAAAAAATAAGT TT GAT CACATT T 4 80 

Qy 481 CAAAGAT GGCATTT CCCCCAAT GAAATACACAAGTAAACATT C 523 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 481 CAAAGAT GGCATTT CCCCCAAT GAAATACACAAGTAAACATT C 523 



RESULT 3 
AX147742 

LOCUS AX147742 517 bp DNA linear PAT 31-AUG-2001 

DEFINITION Sequence 1 from Patent WO0136483. 
ACCESSION AX147742 



VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 

FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



AX147742.1 GI: 1434 67 87 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Goldspink, G. R. and Johnson, I. R. 

Use of the insulin-like-growth factor i isoform mgf for the 
treatment of neurological disorders 
Patent: WO 0136483-A 1 25-MAY-2001; 
University College London (GB) 

Location/Qualifiers 

1. .517 

/organism="Homo sapiens" 
/mol_type=" genomic DNA" 
/db_xref="taxon: 9606" 
<1. .333 

/note="unnamed protein product" 
/codon_start=l 
/protein_id="CAC41175.1" 
/db_xref="GI: 14346788" 
/db_xref =" REMTREMBL : CAC4 117 5" 

/trans la tion="GPETLCGAELVDALQFVCGDRGFYFNKPTGYGSSSRRAPQTGIV 
DECCFRSCDLRRLEMYCAPLKPAKSARSVRAQRHTDMPKTQKYQPPSTNKNTKSQRRK 
GSTFEEHK" 
150 a 130 c 139 g 98 t 



Query Match 89.4%; 
Best Local Similarity 96.2%; 
Matches 501; Conservative 



Score 467.4; DB 6; 
Pred. No. 1.3e-132; 
0; Mismatches 16; 



Length 517; 
Indels 4; 



Gaps 



2; 



Qy 

Db 



GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 



QY 
Db 

Qy 

Db 

QY 
Db 

QY 
Db 

QY 
Db 



61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

121 AC AGGCAT C GT GGAT GAGT GCT GCTT C C GGAGCT GT GAT CT GAGGAGG CT GGAGAT GT AC 180 

I I I I I I I I I I I I I I I I I I I II I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

121 ACAGGCAT C GT GGAT GAGT GCT GCTTCCGGAGCT GTGAT CTAAGGAGGCT GGAGAT GTAT 18 0 

181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

II I I I I I I I I I I I I I I II III I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I 

181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 

241 AT GC CCAAGACT CAGAAGTAT CAGC CT C CAT CT ACCAACAAGAAAAT GAAGT CT C AGAGG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
241 AT GC CCAAGACCCAGAAGT AT C AGC C C C CAT CT ACCAACAAGAAC ACGAAGT CT C A G 297 

301 AGAAGGAAAGGAAGT ACATT T GAAGAAC ACAAGT AGAGGGAGT GCAGGAAACAAGAACT A 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 

298 AGAAGGAAAGGAAGT ACAT T T GAAGAAC ACAAGT AGAGGGAGT GCAGGAAACAAGAACT A 357 



Qy 361 CAGGAT GT AGGAAGACCCTT CTGAGGAGT GAAGAAGGACAGGCCACCGCAGGACCCTTTG 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 358 CAGGAT GT A- GAAGACCCTT CTGAGGAGT GAAGAAGGACAGGCCACCGCAGGACCCTTT G 416 

Qy 421 CT CT GC ACAGT T ACCT GT AAACATT GGAAT AC CGGC CAAAAAAT AAGT T T GAT CAC AT T T 480 

I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 417 CT CT G CAC AGT T ACCT GT AAACATT GGAAT AC CGGC CAAAAAAT AAGT T T GAT CAC AT T T 476 

Qy 481 C AAAGAT G G CAT T T C C C C C AAT GAAAT AC AC AAGT AAAC AT 521 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I M I I 
Db 477 CAAAGAT GGC AT TTCC C C CAAT GAAAT ACACAAGTAAACAT 517 



RESULT 4 
AX300779 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



AX300779 
Sequence 1 
AX300779 
AX300779.1 



CDS 



517 bp 
from Patent WO0185781. 

GI:17382060 



DNA 



linear PAT 30-NOV-2001 



BASE COUNT 
ORIGIN 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Goldspink, G. D. and Terenghi, G. B . 

Repair of nerve damage 

Patent: WO 0185781-A 1 15-NOV-2 001; 

University College London (GB) ; East Grinstead Medical Research 
Trust (GB) 

Location/Qualifiers 

1. .517 

/organism="Homo sapiens" 
/mol_type=" genomic DNA" 
/db_xref="taxon: 9606" 
<1. -333 

/note="unnamed protein product" 
/ codon_start=l 
/protein_id="CAD13040. 1" 
/db_xref="GI : 17382061" 

/trans la tion="GPETLCGAELVDALQFVCGDRGFYFNKPTGYGSSSRRAPQTGIV 
DECCFRSCDLRRLEMYCAPLKPAKSARSVRAQRHTDMPKTQKYQPPSTNKNTKSQRRK 
GSTFEEHK" 
150 a 130 c 139 g 98 t 



Query Match 89.4%; 
Best Local Similarity 96.2%; 
Matches 501; Conservative 



Score 4 67.4; DB 6; 
Pred. No. 1.3e-132; 
0; Mismatches 16; 



Length 517; 
Indels 4; 



Gaps 



2; 



Qy 
Db 



1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 



Qy 

Db 



61 AGGGG CT T T T ATTT CAACAAGC CC ACAGGAT AC G GCT C CAGCAGT C GGAGGGCAC CT C AG 120 

I I I I I I I I II I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I 

61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 120 



Qy 
Db 



121 
121 



ACAGG C AT CGT GGAT GAGT GCT GCT T C C GGAGCT GT GAT C T GAGGAGGCT GGAGAT GT AC 
I I I M II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
ACAGGCAT CGTGGAT GAGT GCT GCTTCCGGAGCT GT GAT CTAAGGAGGCT GGAGAT GTAT 



180 
180 



Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

II I I I I I I I I I I I I I I II III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 

Qy 241 ATGC C CAAGACT CAGAAGT AT CAGC CT C CAT CT AC CAACAAGAAAAT GAAGT CT CAGAGG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I 
Db 241 ATGC C CAAGAC C CAGAAGT AT CAGC C C C CAT CT AC CAACAAGAACAC GAAGT CT CA G 297 

Qy 301 AGAAGGAAAGGAAGT ACAT TT GAAGAAC ACAAGT AGAGGGAGT GCAGGAAACAAGAACTA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 298 AGAAGGAAAGGAAGT ACAT TT GAAGAAC ACAAGT AGAGGGAGT GCAGGAAACAAGAACTA 357 

Qy 361 CAGGAT GTAGGAAGACC CT T CT GAGGAGT GAAGAAGGACAGGC CAC C GC AGGAC CCTTT G 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 

Db 358 CAGGAT GTA- GAAGACC CT T CT GAGGAGT GAAGAAGGACAGGC CAC C GC AGGAC C CTTTG 416 

Qy 421 CTCT GCACAGTTACCTGTAAACATT GGAATACCGGCCAAAAAATAAGTTT GAT CACATTT 4 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I 
Db 417 CTCT GCACAGTTACCTGTAAACATT GGAATACCGGCCAAAAAATAAGTTT GAT CACATTT 47 6 

Qy 4 81 CAAAGAT GGCATTTCCCCCAAT GAAAT ACACAAGTAAACAT 521 

I I I I I I I I I I I I M I I I I I I I I I I I I I II I I I I I I I I I I II 
Db 477 CAAAGAT GGCATTTCCCC CAAT GAAAT ACACAAGTAAACAT 517 



RESULT 5 
AX147754 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 

FEATURES 

source 



CDS 



AX147754 471 bp DNA linear PAT 08-JUN-2001 

Sequence 13 from Patent WO0136483. 

AX147754 

AX147754 .1 GI: 14348552 

Oryctolagus cuniculus (rabbit) 
Oryctolagus cuniculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 

1 

Goldspink, G. R. and Johnson, I. R. 

Use of the insulin-like-growth factor i isoform mgf for the 
treatment of neurological disorders 
Patent: WO 0136483-A 13 25-MAY-2001; 
University College London (GB) 

Location/ Qualifiers 

1. .471 

/ organism="Oryctolagus cuniculus" 
/mol_type=" genomic DNA" 
/db_xref="taxon: 9986" 
<1. .318 

/note- "unnamed protein product" 
/codon__start~l 
/protein_id="CAC41264 . 1" 
/db xref="GI: 14348553" 



/translation="GPETLCGAELVDALQFVCGDRGFYFNKPTGYGSSSRRAPQTGIV 
DECCFRSCDLRRLEMYCAPLKPAKAARSVRAQRHTDMPKTQKEVHLKNTSRGSAGNKN 
YRM" 

BASE COUNT 132 a 118 c 131 g 90 t 

ORIGIN 

Query Match 78.2%; Score 409; DB 6; Length 471; 

Best Local Similarity 90.1%; Pred. No. 1.3e-114; 

Matches 471; Conservative 0; Mismatches 0; Indels 52; Gaps 1; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I II I I I I I I I I I I I I I II II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II 

Db 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 

I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

Qy 121 ACAG GCAT CGT GGAT GAGT GCT GCT T C C GGAGCT GT GAT CT GAGGAGGCT GGAGAT GT AC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 121 ACAGGCAT CGT GGAT GAGT GCT GCT T C C GGAGCT GT GAT CT GAGGAGGCT GGAGAT GT AC 180 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

Qy 241 AT GC C CAAGACT CAGAAGT AT CAGC CT CCAT CT AC CAACAAGAAAAT GAAGT CT CAGAGG 300 

I I I I I I I I I I I I I I I 

Db 241 AT GC C CAAGACT C AG 255 

Qy 301 AGAAGGAAAGGAAGT ACAT TT GAAGAAC ACAAGT AGAGGGAGT GCAGGAAACAAGAACTA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 25 6 AAGGAAGT ACAT TT GAAGAAC ACAAGT AGAGGGAGT GCAGGAAACAAGAACTA 308 

Qy 361 CAGGAT GTAGGAAGACC CT T CT GAGGAGT GAAGAAGGAC AGG C CAC C GC AGGAC CCT T T G 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 309 CAGGAT GTAGGAAGACC CT T CT GAGGAGT GAAGAAGGACAGGC CAC C GC AGGAC CCT T T G 368 

Qy 421 CTCT GCACAGT T AC CT GT AAACATT GGAAT AC C GGC CAAAAAATAAGT TT GAT CACAT TT 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 369 CTCT GCACAGT T AC CT GT AAACATT GGAAT AC C GG C CAAAAAATAAGT TT GAT CACAT TT 42 8 

Qy 481 CAAAGAT GGC ATTT C CC C CAAT GAAAT ACACAAGTAAAC AT T C 523 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 42 9 CAAAGAT GGCATTTCCCCCAAT GAAATACACAAGTAAACATTC 471 



RESULT 6 
AX300791 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



AX300791 
Sequence 13 
AX300791 
AX300791.1 

Oryctolagus 
Oryctolagus 



from Patent 



471 bp 
WO0185781. 



DNA 



linear 



PAT 30-NOV-2 001 



GI:17382072 



cuni cuius 
cuniculus 



(rabbit) 



Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 



Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus . 
REFERENCE 1 

AUTHORS Goldspink,G.D. and Terenghi, G . B . 

TITLE Repair of nerve damage 

JOURNAL Patent: WO 0185781-A 13 15-NOV-2 001; 

University College London (GB) ; East Grinstead Medical Research 

Trust (GB) 

FEATURES Location/Qualifiers 
source 1. .471 

/organism= M Oryctolagus cuniculus " 

/mol__type=" genomic DNA" 

/db_xref="taxon:9986" 
CDS <1. .318 

/note="unnamed protein product" 

/codon_start=l 

/protein_id="CAD13045. 1" 

/db_xref="GI : 17382073" 

/ translation="GPETLCGAELVDALQFVCGDRGFYFNKPTGYGSSSRRAPQTGIV 
DECCFRSCDLRRLEMYCAPLKPAKAARSVRAQRHTDMPKTQKEVHLKNTSRGSAGNKN 
YRM" 

BASE COUNT 132 a 118 c 131 g 90 t 

ORIGIN 

Query Match 78.2%; Score 409; DB 6; Length 471; 

Best Local Similarity 90.1%; Pred. No. 1.3e-114; 

Matches 471; Conservative 0; Mismatches 0; Indels 52; Gaps 1; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

Qy 121 ACAGGCAT CGT GGATGAGT GCT GCTTCCGGAGCTGTGAT CT GAGGAGGCTGGAGATGT AC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 121 ACAGGCAT C GT GGATGAGT GCT GCT TC C GGAGCT GT GAT CT GAGGAGGCT GGAGAT GT AC 180 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

Qy 241 AT GC C CAAGACT C AGAAGT AT CAGCCT C CAT CTAC CAACAAGAAAAT GAAGT CT CAGAGG 300 

I I I I I I I I I I I I I I I 

Db 241 AT G C C C AAG AC T C AG 255 

Qy 301 AGAAGGAAAGGAAGTAC AT TT GAAGAAC ACAAGT AGAGG GAGT GCAGGAAACAAGAACT A 360 

II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I 
Db 256 AAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 308 

Qy 361 C AGGAT GT AGGAAGAC C CT T CT GAGGAGT GAAGAAGGAC AGG CCAC C GCAGGACC CTTT G 420 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 309 CAGGAT GT AG GAAGAC C CT T CT GAGGAGT GAAGAAGGAC AGGCCAC C GCAGGACCCT T T G 368 



Qy 



421 



CT CT GCAC AGT T AC CT GT AAACATTGGAAT AC CGGC CAAAAAAT AAGTT T GATCACAT T T 480 
I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 



369 CT CT GCACAGT T AC CTGTAAAC AT TGGAAT AC C GGC CAAAAAATAAGT T T GAT CAC AT TT 428 



Qy 4 81 CAAAGATGGCATTTCCC CCAAT GAAATACACAAGTAAACATT C 523 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 429 CAAAGATGGCATTTCCCCCAAT GAAATACACAAGTAAACATT C 471 



RESULT 7 
BC012409 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REMARK 
COMMENT 



BC012409 1536 bp mRNA linear ROD 16-APR-2003 

Mus musculus insulin-like growth factor 1, mRNA (cDNA clone 
MGC: 18617 IMAGE: 4194295) , complete cds . 
BC012409 

BC0124 09.1 GI: 15214 568 
MGC. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

1 (bases 1 to 1536) 

Strausberg, R. L . , Feingold, E . A. , Grouse,L.H., Derge,J.G., 
Klausner, R. D . , Collins , F. S . , Wagner, L., Shenmen, C .M. , Schuler , G. D . , 
Altschul, S. F. , Zeeberg,B., Buetow,K.H., Schaef er , C . F . , Bhat,N.K., 
Hopkins, R. F. , Jordan, H., Moore, T., Max,S.I., Wang, J. , Hsieh,F., 
Diatchenko, L. , Marusina,K., Farmer, A. A., Rubin, G.M., Hong,L., 
Stapleton,M. , Soares,M.B., Bonaldo,M. F. , Casavant , T . L . , 
Scheetz, T . E . , Browns tein, M. J. , Usdin, T . B . , Toshiyuki, S . , 
Carninci,P., Prange,C, Raha, S.S., Loquellano, N . A. , Peters, G. J., 
Abramson, R. D. , Mullahy, S . J. , Bosak,S.A., McEwan,P.J., 
McKernan, K. J. , Malek, J. A. , Gunaratne, P.H. , Richards, S . , 
Worley,K.C, Hale,S., Garcia,A.M. , Gay,L.J., Hulyk,S.W., 
Villalon, D. K. , Muzny, D .M. , Sodergren, E . J . , Lu,X., Gibbs,R.A., 
Fahey,J., Helton, E., Ketteman,M., Madan,A., Rodrigues , S . , 
Sanchez, A., Whiting, M. , Madan,A., Young, A. C, Shevchenko, Y. , 
Bouf fard, G. G. , Blakesley, R.W- , Touchman, J. W. , Green, E. D. , 
Dickson, M. C . , Rodriguez, A. C . , Grimwood,J., Schmutz,J., Myers, R.M., 
Butterf ield, Y. S . , Krzywinski ,M. I . , Skalska,U., Smailus, D. E. , 
Schnerch,A. , Schein,J.E., Jones, S.J. and Marra,M.A. 
Generation and initial analysis of more than 15,000 full-length 
human and mouse cDNA sequences 

Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) 

22388257 

12477932 

2 (bases 1 to 1536) 
Strausberg, R. 
Direct Submission 

Submitted (15-AUG-2001) National Institutes of Health, Mammalian 
Gene Collection (MGC) , Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, 
USA 

NIH-MGC Project URL: http://mgc.nci.nih.gov 

Contact: MGC help desk 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Jeffrey E. Green, M.D. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Baylor College of Medicine Human Genome 



Sequencing Center 
Center code: BCM-HGSC 

Web site: http://www.hgsc.bcm.tmc.edu/cdna/ 
Contact : amg@bcm. tmc . edu 

Gunaratne, P.H., Garcia, A.M. , Lu, X., Hulyk, S.W., Loulseged, H. 
Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati, 
A.N., Gibbs, R.A. 



FEATURES 

source 



gene 



CDS 



BASE COUNT 
ORIGIN 



Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov 
Series: IRAK Plate: 24 Row: k Column: 11 

This clone was selected for full length sequencing because it 
passed the following selection criteria: matched mRNA gi: 6754307. 

Location/Qualifiers 

1. .1536 

/organism="Mus musculus" 

/mol_type="mRNA" 

/strain="FVB/N" 

/ db_xr e f = " t axon : 1 0 0 9 0 " 

/clone="MGC: 18617 IMAGE : 4194295" 

/tissue type="Liver, normal. 5 month old male mouse." 

/ clone_Tib= "NCI_CGAP_Li 9 " 

/lab_host="DH10B" 

/note= "Vector: pCMV-SP0RT6" 

1. .1536 

/gene="Igfl" 

/db_xref="LocusID: 16000" 
/db_xref="MGI: 96432" 
28. .429 
/codon_start=l 

/product="insulin-like growth factor 1" 
/protein_id="AAH124 09. 1" 
/db_xref="GI : 15214569" 
/db_xref="LocusID: 16000" 

/ trans lation="MSSSHLFYLALCLLTFTSSTTAGPETLCGAELVDALQFVCGPRG 
FYFNKPTGYGSSIRRAPQTGIVDECCFRSCDLRRLEMYCAPLKPTKAARSIRAQRHTD 
MPKTQKS P S LSTNKKTKLQRRRKGST FEEHK " 
485 a 324 c 303 g 424 t 



Query Match 69. 8%; 

Best Local Similarity 83.2%; 
Matches 441; Conservative 



Score 364.8; DB 10; 
Pred. No. 6.6e-101; 
0; Mismatches 82; 



Length 1536; 
Indels 7; Gaps 



2; 



Qy 

Db 



94 



GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCG 153 



Qy 

Db 

Qy 

Db 



61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

154 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCTCAG 213 



121 



180 



ACAGGC AT CGT GGAT GAGT GCT GCT T C CGGAGCT GT GAT CT GAGGAGG CT GGAGAT GTAC 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
214 ACAGGC AT T GT G GAT GAGT GT T GCT T CC G GAGCT GT GAT CT GAGGAGACT GGAGAT GTAC 273 



Qy 



181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 



1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 III 

Db 274 TGTGCCCCACTGAAGCCTACAAAAGCAGCCCGCTCTATCCGTGCCCAGCGCCACACTGAC 333 

Qy 241 AT GCCCAAGACTCAGAAGTAT CAGCCT C CAT CTACCAACAAGAAAAT GAAGT CTCAGAGG 300 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 334 AT G C C CAAGACT CAGAAGT C C CC GT CC CT AT C GACAAACAAGAAAAC GAAGCTGCAAAGG 393 

Qy 301 AGAAGG7\AAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

Db 394 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 453 

Qy 361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 420 

III I I I I I I I I II III I I I I I I II I II I I I I II I I I I I I I I I I 
Db 454 CAGAATGT AGGAGGAGCCT CCCACGGAGCAGAAAAT GC CACATCACCGCAGGATCCTTTG 513 

Qy 421 CT CT GCACAGT T ACCT GTAAAC AT T GGAAT AC C GGC CA AAAAAT AAGT T T GAT C 474 

II I I I I I I I I I I II I III I I I I I I I I I II 

Db 514 CT GCTTGAGCAAC CT GCAAAACAT CGAAACAC CT AC CAAATAACAAT AATAAGTC CAATA 573 

Qy 475 AC AT T T C AAAGAT - GGC AT T T C C C C CAAT GAAAT ACACAAGT AAACAT T C 523 

I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 574 AC AT TACAAAGAT GGGCAT T T C C C C CAAT GAAAT AT AC AAGT AAACAT T C 623 



RESULT 8 

RNIGFI2 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 

FEATURES 

source 



RNIGFI2 798 bp mRNA linear ROD 12-SEP-1993 

Rat mRNA (clone IGF1AB2) for insulin-like growth factor I. 

X06108 M32339 Y00429 

X06108.1 GI:56426 

insulin-like growth factor I. 

Rattus norvegicus {Norway rat) 

Rattus norvegicus ; 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 
Rattus . 
1 

Shimatsu,A. and Rotwein, P. 

Sequence of Two Rat Insulin-like Growth Factor I mRNAs Differing 

Within the 5' Untranslated Region 

Nucleic Acids Res. 15 (1987) In press 

2 (bases 1 to 798) 

Rotwein, P. 

Direct Submission 

Submitted {21-OCT-1987 ) Rotwein P., Washington University, School 
of Medicine, 660 South Euclid Avenue, Box 8127, St. Louis, MO 
63110, USA 

Another IGF-I mRNA of rat liver differing in the 5' UT-region is 
described in <X06107>. 

Location/Qualif iers 

1. .798 

/organism =l 'Rattus norvegicus" 
/mol_type="mRNA" 
/strain="Sprague-Dawley" 
/db_xref="taxon: 10116" 
/tissue_type="liver" 



misc feature 



.72 



/note="5' UT-region" 
CDS 73. .552 

/note= M IGF-I (AA 1-159)" 
/codon_start=l 
/protein_id="CAA29481. 1" 
/db_xref="GI: 56427" 

/ translation="MGKISSLPTQLFKICLCDFLKIKIHIMSSSHLFYLALCLLTFTS 
SATAGPETLCGAELVDALQFVCGPRGFYFNKPTGYGSSIRRAPQTGIVDECCFRSCDL 
RRLEMYCAPLKPTKSARSIRAQRHTDMPKTQKSQPLSTHKKRKLQRRRKGSTLEEHK" 
misc_f eature 553. .798 

/note="3 f UT-region" 

BASE COUNT 238 a 196 c 183 g 181 t 

ORIGIN 



Query Match 69.1%; Score 361.6; DB 10; Length 798; 

Best Local Similarity 82.8%; Pred. No. 5.8e-100; 

Matches 439; Conservative 0; Mismatches 84; Indels 7; Gaps 2; 



Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 217 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 27 6 



Qy 61 AGGGGCTTTTAT TT CAACAAGCC CACAGGATACGGCT CCAGCAGT CGGAGGGCACCTCAG 12 0 

I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I III 
Db 277 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCACAG 336 



Qy 121 ACAGGCAT CGTGGATGAGTGCT GCTT CCGGAGCT GT GAT CTGAGGAGGCT GGAGATGTAC 18 0 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 337 AC GGGCAT T GT G GAT GAGTGT T GCTT CC GGAGCT GT GAT CT GAGGAGGCT GGAGAT GT AC 396 



Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

I I I I I II II I I I I I I I I I I I I I I II III I I I I I I I I I I I I I I I I I I I I I 

Db 397 T GT GCT C C G CT GAAGC CT ACAAAGT CAG CT C GT T C CAT C C GGGC CCAGC GCCACACT GAC 45 6 

Qy 241 AT GCCCAAGACT CAGAAGTAT CAGCCT C CAT CTAC CAACAAGAAAATGAAGT CT C AGAGG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I III II MINIMI I I I I II III 
Db 457 AT GCCCAAGACT CAGAAGTC C C AGCC CCT AT C GAC ACACAAGAAAAGGAAGCT GCAAAGG 516 



Qy 301 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

I I I I I I I I I I I I I I I I I I I M II I I I I II I I I II I I I I I I II I I I I I I I I I I I I III 
Db 517 AGAAGGAAAGGAAGT AC ACTT GAAGAACACAAGT AGAGGAAGT GCAGGAAACAAGAC CT A 57 6 



Qy 361 CAGGAT GTAGGAAGACCCTTCT GAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTT G 42 0 

III I I I I I I I I II III I I I I I I I II I II I I I I I I I I II I I I I I I 

Db 577 CAGAAT GTAGGAGGAGC CTCCCGAGGAACAGAAAAT GCCACGTCACCGCAAGATCCTTT G 636 



Qy 421 CT CT GCACAGTT ACCTGTAAACATT GGAATACCGGCCA AAAAAT AAGT T T GAT C 474 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 637 CT GCT T GAGCAAC CT GCAAAAC ATC GGAACAC CT GC CAAATAT CAATAAT GAGTT CAAT A 696 

Qy 475 AC AT T T C AAAG AT - G G CAT T T C C C C CAAT G AAAT AC AC AAGT AAAC AT T C 523 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 697 C CAT T T C AGAGAT GG GC ATT T C C CT CAAT GAAAT ACAC AAGT AAAC AT T C 746 



RESULT 9 
RNIGFIl 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 

FEATURES 

source 



misc feature 



CDS 



misc feature 



BASE COUNT 
ORIGIN 



RNIGFI1 958 bp mRNA linear ROD 12-SEP-1993 

Rat mRNA (clone IGF1AB1) for insulin-like growth factor I. 

X06107 M32260 Y00429 

X06107.1 GI:56424 

insulin-like growth factor I. 

Rattus norvegicus (Norway rat) 

Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 
Rattus . 
1 

Shimatsu,A. and Rotwein,P. 

Sequence of two rat insulin-like growth factor I mRNAs differing 

within the 5 f untranslated region 

Nucleic Acids Res. 15 (17), 7196 (1987) 

88015572 

3658684 

2 (bases 1 to 958) 
Rotwein, P . 
Direct Submission 

Submitted (21-OCT-1987 ) Rotwein P . , Washington University, School 
of Medicine, 660 South Euclid Avenue, Box 8127, St. Louis, MO 
63110, USA 

Another IGF-I mRNA of rat liver differing in the 5 1 UT-region is 
described in <X06108>. 

Location/ Qualifiers 

1. .958 

/organism="Rattus norvegicus" 
/mol_type="mRNA" 
/ strain="Srpague-Dawley" 
/db_xref="taxon: 10116" 
/ tissue__type=" liver" 
1. .258 

/note="5' UT-region" 
259. .738 

/note="IGF-I (AA 1-159)" 
/ codon_start=l 
/protein_id="CAA29480. 1" 
/db_xref = M GI : 56425" 
/db_xref="SWISS-PROT:P08024" 

/ trans la t ion- "MGKISSLPTQLFKICLCDFLKIKIHIMSSSHLFYLALCLLTFTS 
SATAGPETLCGAELVDALQFVCGPRGFYFNKPTGYGSSIRRAPQTGIVDECCFRSCDL 
RRLEMYCAPLKPTKSARSIRAQRHTDMPKTQKSQPLSTHKKRKLQRRRKGSTLEEHK" 
736. .958 

/note="3' UT-region" 
260 a 246 c 209 g 243 t 



Query Match 69* 1%; 

Best Local Similarity 82.8%; 
Matches 439;- Conservative 



Score 361.6; DB 10; Length 958; 
Pred. No. 6e-100; 
0; Mismatches 84; Indels 7; 



Gaps 



2; 



Qy 



Db 



1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 

4 03 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 462 



Qy 



61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 

Db 4 63 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCACAG 522 

Qy 121 ACAGGC AT CGT GGAT GAGT G CT GCT T C C GGAGCT GT GAT CT GAGGAGGCT GG AGAT GTAC 180 

II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 

Db 523 ACGGGC AT T GT G GAT GAGT GT T GCT T C C GGAGCT GT GAT CT GAGGAGGCT GGAGAT GTAC 582 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

I I I I I II II I I I I I I I I I I I I I I II III I I I I I I I I I I I I I I I I I I III 

Db 58 3 TGTGCTCCGCTGAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 642 

Qy 241 AT GC C CAAGACT CAGAAGT AT C AGC CT C CAT CTAC CAACAAGAAAAT GAAGTCT C AGAGG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I III II I I I II I I I I I I I I II III 
Db 643 AT G C C C AAG AC T CAGAAGT C C C AG C C C C TAT C G AC AC AC AAGAAAAG GAAG C T G C AAAG G 7 02 

Qy 301 AGAAG GAAAGGAAGT ACAT T T GAAGAAC ACAAGT AGAGGGAGT GCAGGAAACAAGAACTA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

Db 703 AGAAG GAAAGGAAGT ACACT T GAAGAAC ACAAGT AGAGGAAGT GCAGGAAACAAGAC CTA 7 62 

Qy 361 CAGGAT GTAGGAAGAC CCT T CT GAGGAGT GAAGAAGGACAGGC CAC CGC AGGAC CCT TT G 420 

III I I I I I I I I II III I I I I I I I II I II I I I I I I II II I I I I I I 

Db 763 CAGAAT GTAGGAGGAGCCT CCCGAGGAACAGAAAAT GCCACGT CACCGCAAGATCCTTTG 822 

Qy 421 CT CT GC ACAGT T AC CT GT AAACATT GGAAT AC CGGC CA AAAAATAAGTTT GAT C 474 

II I I I II I I I I II I I I I I I I I I I I I I I I I II 

Db 823 CT GCT T GAGCAAC CT GCAAAACAT C G GAACAC CT GC CAAAT AT CAAT AAT GAGTT CAATA 8 82 

Qy 475 ACATTTCAAAGAT - GGCATTT CCCCCAAT GAAATACACAAGTAAACATT C 523 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 8 83 CCATT T C AGAGAT GGGCAT TT C C CT CAAT GAAAT ACACAAGTAAACAT T C 932 



RESULT 10 

RATIGFIA 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 



JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



RATIGFIA 710 bp mRNA linear ROD 27-APR-1993 

Rat insulin-like growth factor I (IGF-I) mRNA, complete cds . 
M15480 

M15480.1 GI:204749 

growth factor; insulin-like growth factor. 
Rattus norvegicus (Norway rat) 
Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 
Rattus . 

1 (bases 1 to 710) 

Roberts, C.T. Jr., Lasky,S.R., Lowe,W.L. Jr., Seaman, W.T. and 
LeRoith, D. 

Molecular cloning of rat insulin-like growth factor I complementary 

deoxyribonucleic acids: differential messenger ribonucleic acid 

processing and regulation by growth hormone in extrahepatic tissues 

Mol. Endocrinol. 1 (3), 243-248 (1987). 

88288198 

3453891 

Original source text: Rat ( Sprague-Dawley) adult liver cDNA to 
mRNA, clone pRIGF-1-42. 

Draft entry and computer-readable copy of sequence in [Mol. 
Endocrinol. (1987) In press] kindly 



FEATURES 

source 



CDS 



sig^peptide 

mat_peptide 

mat_peptide 

mat_peptide 

mat_peptide 

mat_peptide 

BASE COUNT 
ORIGIN 



provided by S.R.Lasky, 16-MAR-1987. 
Location/Qualifiers 
1. .710 

/organism =,, Rattus norvegicus" 
/mol_type="mRNA" 
/db_xref="taxon: 10116" 
106. .507 

/note="insulin-like growth factor precursor" 
/codon_start=l 
/protein_id="AAA41385. 1" 
/db_xref="GI: 204750" 

/translation="MSSSHLFYLALCLLTFTSSATAGPETLCGAELVDALQFVCGPRG 
FYFNKPTGYGS S I RRAPQTGI VDECCFRS CDLRRLEMYCVRCKPTKSARS I RAQRHTD 
MPKTQKSQPLSTHKKRKLQRRRKGSTLEEHK" 
106. .171 

/note="insulin-like growth factor signal peptide" 
172. .258 

/product="insulin-like growth factor B peptide" 
259. .294 

/product="insulin~like growth factor C peptide" 
295. .357 

/product="insulin-like growth factor A peptide" 
358. .381 

/product="insulin-like growth factor D peptide" 
382. .504 

/product="insulin-like growth factor E peptide" 
202 a 191 c 172 g 145 t 
1 bp upstream of EcoRI site. 



Query Match 68.5%; Score 358.4; DB 10; Length 710; 

Best Local Similarity 82.5%; Pred. No. 5.5e-99; 

Matches 4 37; Conservative 0; Mismatches 86; Indels 7; Gaps 



2; 



Qy 

Db 



1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I II I I 

172 GGAC CAGAGACC CTT T GC GGGGCT GAGCT GGT GGAC GCT CT T C AGT T CGT GT GTGGACCA 231 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I III 
232 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCACAG 2 91 



121 



292 



181 



ACAGGCATCGT GGAT GAGTGCTGCTTCCGGAGCT GTGAT CT GAGGAGGCTGGAGATGTAC 180 
II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I 
AC GGGCAT T GT GGAT GAGT GTT GCTTC CGGAGCT GT GAT CT GAGGAGGCT GGAGATGT AC 351 



TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 
I I I I I I I I I I I I I I I I I I I I I II III I I I I I I I I I I I I I I I I I I III 
352 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 411 

241 AT GCCCAAGACT CAGAAGTATCAGCCTCCAT CT ACCAACAAGAAAAT GAAGT CTCAGAGG 300 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 

412 AT GC CCAAGACT CAGAAGT CCCAGCCCCTAT CGACACACAAGAAAAGGAAGCTGCAAAGG 471 



301 



360 



AGAAGGAAAGGAAGT AC AT T T GAAGAACACAAGT AGAGGGAGT GC AGGAAACAAGAACTA 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

472 AGAAGGAAAGGAAGT ACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 531 



Qy 

Db 

Qy 

Db 

Qy 

Db 



361 CAGGAT GT AGGAAGAC C CT T CTGAG GAGT GAAGAAGGAC AGGCCAC C G CAGGACC CT T T G 420 

III I II I I I II II III I I I I I I I II I II I I I I I I I I II I I I I I I 

532 CAGAAT GT AGGAGGAGCCT CCCGAGGAACAGAAAAT GCCACGTCACCGCAAGATCCTTTG 591 

421 CT CT GC ACAGT T AC CT GT AAACATT GGAAT AC C GGC CA AAAAAT AAGT TT GAT C 474 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

592 CT GCT T GAGCAAC CT GCAAAACAT C GGAAC AC CT GC CAAATATCAAT AAT GAGT T CAAT A 651 

4 75 ACATTT CAAAGAT - GGCATTTCCCCCAAT GAAATACACAAGTAAACATT C 523 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
652 CCATTT CAGAGAT GGGCATTTCCCTCAATGAAATACACAAGTAAACATT C 701 



RESULT 11 

AX147744 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 

FEATURES 

source 



linear PAT 31-AUG-2001 



CDS 



BASE COUNT 
ORIGIN 



AX147744 539 bp DNA 

Sequence 3 from Patent WO0136483. 
AX147744 

AX147744.1 GI: 14346789 



Rattus norvegicus (Norway rat) 
Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 
Rattus . 
1 

Goldspink, G. R. and Johnson, I. R. 

Use of the insulin-like-growth factor i isoform mgf for the 
treatment of neurological disorders 
Patent: WO 0136483-A 3 25-MAY-2001; 
University College London (GB) 

Location/ Qualifiers 

1. .539 

/organism-"Rattus norvegicus" 
/mol_type=" genomic DNA" 
/db_xref="taxon: 10116" 
<1. .336 

/note="unnamed protein product" 
/ codon_start=l 
/protein_id="CAC41176. 1" 
/db_xref="GI : 14346790" 
/ db_xr e f = " REMTREMBL : CAC4 117 6" 

/translation="GPETLCGAELVDALQFVCGPRGFYFNKPTVYGSSIRRAPQTGIV 
DECCFRSCDLRRLEMYCVRCKPTKSARS I RAQRHTDMPKTQKSQPLSTHKKRKLQRRR 
KGSTLEEHK" 
161 a 136 c 139 g 103 t 



Query Match 68.2%; 
Best Local Similarity 82.3%; 
Matches 436; Conservative 



Score 356.8; DB 6; 
Pred. No. 1.6e-98; 
0; Mismatches 87; 



Length 539; 
Indels 7; 



Gaps 



2; 



Qy 

Db 



1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I II I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 



Qy 



61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 



Db 



61 



I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I Ml 
AGGGGCTTT T ACTT CAACAAGCC C ACAGT CT AT GGCT C C AGC AT T C GGAGGGCAC CAC AG 120 



Qy 121 ACAGGCAT C GT GGAT GAGT GCT GCTT C C GGAGCT GT GAT CT GAGGAGG CT GGAGAT GT AC 180 

II I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 ACGGGCATT GT GGAT GAGT GTT GCTT CCGGAGCTGT GAT CT GAGGAGGCTGGAGATGTAC 18 0 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

Db 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

Qy 241 AT GC CCAAGACT C AGAAGT AT CAGCCT CC AT CT ACCAAC AAGAAAAT GAAGT CT CAGAGG 300 

I I I I I I I I I I I I I I I I I I I I I I II I I M I I I I I II I I I I I I I I I I I I I 
Db 241 AT GCCCAAGACT CAGAAGT CCCAGCCCCTAT CGACACACAAGAAAAGGAAGCTGCAAAGG 300 

Qy 301 AGAAGGAAAGGAAGT AC AT T T GAAGAACACAAGTAGAGGGAGT GC AGGAAACAAGAACT A 360 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I II I I I I I I II II I I I I I I I I I I I I I I I 

Db 301 AGAAGGAAAGGAAGT AC AC T T GAAGAACACAAGTAGAGGAAGT GCAGGAAACAAGAC CT A 360 

Qy 361 CAGGAT GT AGGAAGAC C CT T CT GAGGAGT GAAGAAGGAC AGGC C AC CGCAGGAC CCT TT G 420 

III I I I I I I I I II III I I I I I I I II I II I I I I I I I I II I I I I I I 

Db 361 C AGAAT GT AGGAGGAGC CT C CCGAGGAAC AGAAAAT GC C AC GT CAC CGCAAGAT CCT TT G 420 

Qy 421 CT CT GCACAGT T AC CT GTAAACAT T GGAAT AC C GGCCA AAAAAT AAGT TT GAT C 474 

II I I I I I I I I I I I I II I I I I I I I I I I I I I II 

Db 421 CTGCTTGAGCAACCT GCAAAACAT CGGAACAC CTGCCAAAT AT CAATAATGAGTTCAATA 48 0 

Qy 47 5 ACAT T T CAAAGAT - GGCAT T T CC C C CAAT GAAATACACAAGTAAAC ATT C 523 

I I I I I I I I I I I I I II I I I I I I I I I Ml I I I I I I I I I I I I I I I I I I I 
Db 481 T CAT T T CAGAGAT GGGC AT T T CC CT CAAT GAAATACACAAGTAAAC ATT C 530 



RESULT 12 

AX300781 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



AX300781 
Sequence 3 
AX300781 
AX300781.1 



CDS 



539 bp 
from Patent WO0185781. 

GI:17382062 



DNA 



linear 



PAT 30-NOV-2001 



Rattus sp. 
Rattus sp. 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 
Rattus . 
1 

Goldspink, G. D . and Terenghi , G . B . 

Repair of nerve damage 

Patent: WO 0185781-A 3 15-NOV-2001; 

University College London (GB) ; East Grinstead Medical Research 
Trust (GB) 

Location/Qualifiers 

1. .539 

/organism="Rattus sp . " 
/mol_type= 11 genomic DNA" 
/db_xref="taxon: 10118" 
<1. .336 

/note="unnamed protein product" 



BASE COUNT 
ORIGIN 



/codon_start=l 
/protein_id= M CAD13041. 1" 
/db_xref="GI : 17382063" 

/ trans la t ion- "GPETLCGAELVDALQFVCGPRGFYFNKPTVYGSSIRRAPQTGIV 
DECCFRSCDLRRLEMYCVRCKPTKSARSIRAQRHTDMPKTQKSQPLSTHKKRKLQRRR 
KGSTLEEHK" 
161 a 136 c 139 g 103 t 



Query Match 68.2%; 
Best Local Similarity 82.3%; 
Matches 436; Conservative 



Score 356.8; DB 6; 
Pred. No. 1.6e-98; 
0; Mismatches 87; 



Length 539; 
Indels 7; 



Gaps 



2; 



QY 
Db 



GGAC C GGAGAC GCT CT GCGGT GCT GAGCT GGT GGAT GCT CT T CAGT T C GT GT GT GGAGAC 60 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 



QY 
Db 

Qy 

Db 

Qy 

Db 



61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

I I I I I I I I I I I I M I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I III 

61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCATTCGGAGGGCACCACAG 12 0 

121 ACAGGC AT C GT GGAT GAGT GCTGCTT C C GGAGCT GT GAT CT GAGGAG GCT GGAGAT GTAC 180 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

121 AC G GGC AT T GT GGAT GAGT GTTGCTT C C GGAGCT GT GAT CT GAGGAGGCT GGAGAT GTAC 18 0 

181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

I I I I I I I I I I II I I I I I I I II II I II I I I I I I I I I I II I I I I I I III 

181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 



QY 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



241 AT GC C CAAGACT CAGAAGTATCAGC CT C CAT CT AC CAACAAGAAAAT GAAGT CT CAGAGG 300 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
241 AT GC C CAAGACT CAGAAGTCCCAGC C C CT AT C GACAC ACAAGAAAAGGAAGCT GCAAAGG 300 

301 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

301 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 360 

361 CAGGAT GT AGGAAGAC CCT TCT GAGGAGT GAAGAAGGACAGGC CAC C GCAGGACCCTTT G 420 

III I I I I I I I I II III I I II I I I II I II I I I I I I I I II I I I I I I 

361 CAGAAT GT AGGAG GAGC CT CCCGAGGAACAGAAAAT GC C AC GT CAC C GCAAGAT CCTTT G 420 

421 CT CT GCAC AGT T AC CT GTAAACATT GGAAT AC C G GC C A ~ AAAAAT AAGT T T GAT C 474 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

421 CT GCT T GAGCAAC CT G CAAAACAT C GGAACAC CT GC CAAAT AT CAAT AAT GAGT T CAAT A 480 

475 ACATTT CAAAGAT - GGCATTTCCCCCAAT GAAATACACAAGTAAACATT C 523 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
481 T CATTT CAGAGAT GGGCATTTCCCTCAAT GAAATACACAAGTAAACATT C 530 



RESULT 13 

MMIGFIBR 

LOCUS 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 



MMIGFIBR 651 bp mRNA linear ROD 21-MAR-1995 

Mouse mRNA for preproinsulin-like growth factor IB. 

X04482 

X04482.1 GI:51806 

growth factor; insulin-like growth factor IB; preproinsulin-like 



SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



FEATURES 

source 



misc feature 



CDS 



sig_peptide 
mat_peptide 
misc_f eature 
polyA_site 



BASE COUNT 
ORIGIN 



growth factor IB; signal peptide. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 651) 

Bell, G. I., Stempien,M.M. , Fong,N.M. and Rail, L. B. 

Sequences of liver cDNAs encoding two different mouse insulin-like 
growth factor I precursors 

Nucleic Acids Res. 14 (20), 7873-7882 (1986) 

87040760 

3774549 

The sequence is identical to the preproIGF-IA sequence (X04480) 
except for the presence of a 52 bp insertion following codon 86 
(position 397 to 448), caused by alternative RNA splicing. The B 
domain of IGF comprises residues 1-29 (position 139-225) , the C 
domain residues 30-41 (position 226-261), the A domain residues 
42-62 (position 262-324) and the D domain residues 63-70 (position 
325-348) . 

Location/Qualifiers 
1. .651 

/organism="Mus musculus" 
/mo^type^'mRNA" 
/strain="Balb/c" 
/db_xref="taxon: 10090" 
/clone="migfl-4" 
/tissue_type="liver" 
43. .45 

/note="pot. translation start" 
73. .474 

/note="prepro IGF-IB (aa -22 to 111)" 
/codon_start=l 
/protein_id="CAA28170 . 1" 
/db_xref="GI: 51807" 
/db_xref ="MGI : 96432 " 
/db_xref="SWISS-PROT: P05018" 

/ translation="MSSSHLFYLALCLLTFTSSTTAGPETLCGAELVDALQFVCGPRG 
FYFNKPTGYGS SI RRAPQTGI VDECCFRSCDLRRLEMYCAPLKPTKAARSI RAQRHTD 
MPKTQKSPSLSTNKKTKLQRRRKGSTFEEHK" 
73. .138 

/note="signal peptide (aa -22 to -1) " 
139. .348 

/product="mature IGF-IB (aa 1-70)" 
349. .471 

/note="COOH-terminal peptide (E domain) (aa 71 to 111)" 
651 

/note="polyA site" 
193 a 185 c 149 g 124 t 



Query Match 66.8%; Score 349.4; DB 10; Length 651; 

Best Local Similarity 82.8%; Pred. No. 3.2e-96; 

Matches 425; Conservative 0; Mismatches 81; Indels 7; Gaps 2; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I II I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 139 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCG 198 



Qy 61 AG GGGCT T TTAT T T CAACAAG C CCACAGGATAC GGCT CGAGC AGT C GGAGGGCAC CT C AG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I 

Db 199 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCTCAG 258 

Qy 121 ACAGGCATCGT GGAT GAGT GCT GCTTCCGGAGCT GTGATCTGAGGAGGCT GGAGAT GTAC 180 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 
Db 259 AC AG GCAT T GT GGAT GAGT GT T GCTT CC GGAGCT GT GATCT GAGGAGACT GGAGAT GTAC 318 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

I I I I I II II IMM I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

Db 319 T GT G C C C CACT GAAG C CTACAAAAGCAGC C C G CT CT AT CCGT GC C C AGC GC CACAC T GAC 378 

Qy 241 ATGCCCAAGACT CAGAAGTATCAGCCTCCAT CTACCAACAAGAAAAT GAAGTCTCAGAGG 300 

II I I I I I I I I I I I I I I I I I I I I I III II I I I I I I I I I I I I I I II III 

Db 37 9 AT GCC CAAGACT C AGAAGT C C C CGTC C CT AT C GACAAACAAGAAAAC GAAGCTGCAAAG G 438 

Qy 301 AGAAGGAAAGGAAGT ACAT T T GAAGAAC ACAAGT AGAGGGAGT GCAGGAAACAAGAACT A 360 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
Db 439 AGAAGGAAAGGAAGT ACAT TT GAAGAAC ACAAGT AGAGGAAGTGCAGGAAACAAGACCT A 498 

Qy 361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 4 99 CAGAAT GT AGGAGGAGCCT C C CACGGAGC AGAAAAT GCCACAT CAC C GC AGGATC CT T T G 558 

Qy 421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA AAAAAT AAGT T T GAT C 474 

II I I I I I I I I I I I I I III I I I I I I I I I II 

Db 559 CT GCTTGAGCAACCT GCAAAACATCGAAACACCTACCAAATAACAATAATAAGTCCAATA 618 

Qy 475 ACAT TT CAAAGAT - GGCAT T T C CC CCAAT GAAA 506 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 619 ACATTACAAAGAT GGGCATTT CCCCCAAT GAAA 651 



RESULT 14 

AF022961 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



AF022961 432 bp mRNA linear MAM 14-OCT-1997 

Oryctolagus cuniculus insulin-like growth factor IB (IGF-IB) mRNA, 
complete cds . 
AF022961 

AF022 961. 1 GI: 25222 01 

Oryctolagus cuniculus (rabbit) 
Oryctolagus cuniculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 

1 (bases 1 to 432) 

Flekna,G., Brem,G. and Mueller, M. 

Direct Submission 

Submitted ( 05-SEP-1997 ) Institute of Animal Breeding and Genetics , 
Veterinary University of Vienna, Veterinaerplatz 1, Vienna A-1210, 
Austria 

Location/Qualifiers 
1. .432 

/ organism= M Oryctolagus cuniculus " 
/ mo 1_ t yp e = " mRN A " 
/strain="ZIKA hybrid strain" 



/db_xref="taxon:9986" 

/cell_type= "primary liver cells" 

/ tissue_type="liver " 
gene 1. .432 

/gene="IGF-IB" 
CDS 1. .432 

/gene="IGF-IB" 

/ codon_start=l 

/product : ="insulin-like growth factor IB" 
/protein_id="AAB8 0950. 1" 
/db_xref="GI: 2522202" 

/ trans lation="MITPTVKMRILSSSHLFYLALCLLTFTSSATAGPETLCGAELVD 
ALQFVCGDRGFYFNKPTGYGSSSRRAPQTGIVDECCFRSCDLRRLEMYCAPLKPAKAA 
RSVRAQRHTDMPKTQKYQPPSTNKKMKSQRRRKGSTFEEHK" 

BASE COUNT 105 a 126 c 115 g 86 t 

ORIGIN 

Query Match 64.2%; Score 336; DB 4; Length 432; 

Best Local Similarity 100.0%; Pred. No. 4.1e-92; 

Matches 336; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 97 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 156 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 157 AGGGGCT TTT AT T T CAACAAGC C CAC AGGAT ACGGCT C C AGCAGT C GGAGGGCAC CT CAG 216 

Qy 121 ACAGGCATCGT GGATGAGT GCT GCTTC CGGAGCTGT GATCTGAGGAGGCTGGAGAT GTAC 180 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 217 AC AGG CAT CGT GGAT GAGT GCT GCTT C C GGAGCT GT GAT CTGAGGAGGCT GGAGAT GTAC 27 6 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 277 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 336 

Qy 241 AT GCCCAAGACTCAGAAGTAT CAGCCT CCAT CTACCAACAAGAAAATGAAGT CTCAGAGG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 337 ATGCCCAAGACT CAGAAGTAT CAGCCT CCATCTACCAACAAGAAAATGAAGT CTCAGAGG 396 

Qy 301 AGAAGGAAAG GAAGTAC AT T T GAAGAAC ACAAGT AG 336 

f I I I I I I I II I I II I I I I I II I I I I I I I II I I I I I I 
Db 397 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAG 432 



RESULT 15 

AX375028 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



AX375028 
Sequence 
AX375028 
AX375028. 



31 



7260 bp 
from Patent WO0210436. 



DNA 



linear 



PAT 01-MAR-2002 



1 GI:19169860 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



BASE COUNT 
ORIGIN 



1 

Baak, J. and Mutter, G.L. 

Prognostic classification of breast cancer 

Patent: WO 0210436-A 31 07-FEB-2002; 

THE BRIGHAM AND WOMEN'S HOSPITAL, INC. (US) 

Location/ Qualifiers 

1. .7260 

/organism^'Homo sapiens" 
/mol_type=" genomic DNA" 
/db_xref="taxon: 9606" 
2330 a 1415 c 1240 g 2275 t 



Baak, Jan (US) 



Query Match 63.9%; 
Best Local Similarity 84.6%; 
Matches 445; Conservative 



Score 334.4; DB 6; 
Pred. No. 2e-91; 
0; Mismatches 26; 



Length 72 60; 
Indels 55; 



Gaps 



4; 



Qy 



Db 



311 



GGACCGGAGACGCT CT GCGGT GCT GAGCT GGT GGAT GCT CTTCAGTT CGT GT GT GGAGAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 370 



Qy 



Db 



61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I II I I I I I I I I I I I I I I I I I I I I 
371 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 430 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



121 AC AGGC AT C GT GGAT GAGT GCT GCT T C C GGAGCTGT GAT CT GAGGAGGCT GGAGAT GT AC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

431 ACAGGC AT C GT GGAT GAGT GCT GCT T C C GGAGCT GT GAT CTAAGGAGGCT GGAGAT GT AT 490 

181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

II I I I I I I I I I I I I I I II III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

491 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 550 

241 ATGCCCAAGACT CAGAAGTATCAGCCT CCAT CTACCAACAAGAAAAT GAAGTCT CAGAGG 300 
I I I I I I I I I II III 

551 ATGCCCAAGACCCAG 565 

301 AGAAG GAAAGGAAGT ACAT T T GAAGAAC ACAAGT AGAGGGAGT GCAGGAAACAAGAACT A 360 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
566 AAGGAAGT ACAT T T GAAGAACG CAAGT AGAGGGAGT GCAGGAAACAAGAACT A 618 

361 CAGGAT GTAGGAAGACCCTTCTGAGGAGT GAAGAAGGACAGGCCACCGCAGGACCCTTT G 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
619 CAGGAT GT AGGAAGAC C CT C CTGAGGAGT GAAGAGT GAC AT GC CAC C GCAGGAT C CT T T G 678 

421 CT CTGC AC - AGTT AC CT G - TAAACAT T GGAAT AC C GGC CAAAAAAT AAGT T T GAT CAC AT 478 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I 
679 C T CT G CAC GAGT T AC CT GT T AAACT T T GGAACACCT AC CAAAAAAT AAGT TT GAT AAC AT 738 



Qy 



Db 



47 9 T T CAAAGAT - GGCAT T T C C C C CAAT GAAAT ACACAAGT AAACATT C 523 

II I I I I I I III I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 
739 T TAAAAGAT GGGCGT TT C CC CCAAT GAAAT ACACAAGTAAACAT T C 784 



Search completed: December 13, 2003, 09:27:34 
Job time : 2336.77 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



December 13, 2003, 02:35:18 ; Search time 209.995 Seconds 

(without alignments) 
6723.048 Million cell updates/sec 

US-09-852-261-5 
523 

1 ggaccggagacgctctgcgg aaatacacaagtaaacattc 523 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



2552756 seqs, 1349719017 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



5105512 



Database 



N_Geneseq_19 Jun03 : 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 



/SIDS1/ gcgdata/ geneseq/ genes eqn- 
/ SIDS1/ gcgdata/ geneseq/ geneseqn 
/SIDSl/gcgdata/geneseq/geneseqn 
/SIDS1/ gcgdata/ geneseq/ geneseqn- 
/SIDSl/gcgdata/ geneseq/ geneseqn 
/ SIDSl/gcgdata/ geneseq/ geneseqn 
/SIDS1/ gcgdata/ geneseq/ geneseqn 
/SIDSl/gcgdata/ geneseq/ geneseqn- 
/ SIDSl/gcgdata/ geneseq/ geneseqn 
/SIDSl/ gcgdata/ geneseq/geneseqn 
/SIDSl/gcgdata/geneseq/geneseqn 
/ SIDSl/gcgdata/geneseq/ geneseqn 
/ SIDSl/ gcgdata/ geneseq/geneseqn 
/SIDSl/gcgdata/geneseq/geneseqn 
/SIDSl/ gcgdata/ geneseq/geneseqn 
/SIDSl/gcgdata/geneseq/geneseqn 
/SIDSl/ gcgdata/ geneseq/geneseqn 
/ SIDSl/gcgdata/geneseq/ geneseqn 
/SIDSl/ gcgdata/ geneseq/ geneseqn 
/SIDSl/ gcgdata/ geneseq/geneseqn 
/SIDSl/ gcgdata/ geneseq/geneseqn 
/SIDSl/gcgdata/geneseq/geneseqn 
/SIDSl/ gcgdata/ geneseq/geneseqn 
/SIDSl/ gcgdata/ geneseq/geneseqn 
/ SIDSl/ gcgdata/ geneseq/geneseqn 



embl/NA1980.DAT:* 
embl/NA1981 .DAT: * 
embl/NA1982.DAT:* 
erabl/NA1983 .DAT: * 
embl/NA1984 .DAT: * 
embl/NA1985.DAT:* 
embl/NAl 986. DAT : * 
embl/NAl 987. DAT : * 
embl/NAl 9 8 8. DAT:* 
embl/NAl 989. DAT : 
embl/NAl 990. DAT : 
embl/NAl 991. DAT : 
embl/NAl 9 92 . DAT : 
L-embl/NA1993 . DAT : 
embl/NAl 994. DAT : 
embl/NAl 995. DAT : 
l- embl / NAl 996. DAT : 
embl/NA1997 . DAT : 
embl/NA1998 . DAT : 
embl/NA1999 . DAT : 
embl /NA2 000 . DAT : 
emb 1 / N A2 0 0 1 A . DAT 
embl /NA2 00 IB. DAT 
embl /NA2 002 . DAT : 
embl/NA2003.DAT: 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 



and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 



Jo. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


523 


100. 


0 


523 


22 


AAD06400 


Rabbit IGF-I isofo 


2 


523 


100. 


0 


523 


24 


AAS16879 


Rabbit mechano-gro 


3 


523 


100. 


0 


553 


18 


AAT84893 


Rabbit insulin lik 


4 


467 . 4 


89. 


4 


517 


22 


AAD06398 


Human IGF— I isofor 


5 


467 . 4 


89. 


4 


517 


24 


AAS16877 


Human mechano-grow 


6 


409 


78. 


2 


471 


22 


AAD06405 


Rabbit liver-type 


7 


409 


78 . 


2 


471 


24 


AAS16884 


Rabbit insulin-lik 


8 


356.8 


68 . 


2 


539 


22 


AAD06399 


Rat IGF-I isoform 


9 


356. 8 


68. 


2 


539 


24 


AAS16878 


Rat mechano-growth 


10 


349.4 


66. 


8 


651 


25 


ABV76185 


Mouse insulin-like 


11 


334 . 4 


63. 


9 


818 


8 


AAN70436 


Sequence encoding 


12 


334 . 4 


63. 


9 


7260 


24 


ABT11091 


Human breast cance 


13 


334 . 4 


63. 


9 


7260 


24 


ABK84583 


Human cDNA differe 


14 


334 . 4 


63. 


9 


7260 


24 


ABN97244 


Gene #3742 used to 


15 


334 . 4 


63. 


9 


7260 


24 


ABK64812 


Human benign prost 


16 


334 . 4 


63. 


9 


7260 


24 


ABK35504 


Human endometrial 


17 


334 . 4 


63. 


9 


7260 


24 


ABK35561 


Gene IGF1 differen 


18 


332 . 8 


63. 


6 


777 


18 


AAT84894 


Human insulin like 


19 


331 . 2 


63. 


3 


622 


7 


AAN60490 


Human prepro— somat 


20 


281.8 


53. 


9 


978 


14 


AAQ47804 


Sequence encoding 


21 


275 . 2 


52. 


6 


1136 


8 


AAN70435 


Sequence encoding 


22 


274 . 6 


52. 


5 


3599 


19 


AAV50428 


Plasmid pIG0552 lo 


23 


274 . 6 


52. 


5 


3599 


19 


AAV4 0796 


Actual sequence of 


24 


274 . 6 


52. 


5 


3600 


19 


AAV50427 


Plasmid pIG0552 up 


25 


274 . 6 


52. 


5 


3600 


19 


AAV40795 


Expected sequence 


26 


274 . 6 


52 . 


5 


5707 


20 


AAX88055 


Plasmid pIG0335 DN 


27 


274 . 6 


52. 


5 


6345 


20 


AAX88054 


Plasmid pIGOlOOA D 


28 


273 . 6 


52. 


3 


612 


22 


AAS14695 


Human cDNA encodin 


29 


273. 6 


52. 


3 


612 


25 


ABZ83309 


Toxicologically re 


30 


266. 8 


51. 


0 


1052 


20 


AAX27498 


Rat liver form of 


31 


262 


50. 


1 


487 


22 


AAD06404 


Rat liver-type IGF 


32 


262 


50. 


1 


487 


24 


AAS16883 


Rat insulin-like g 


33 


250 


47. 


8 


671 


24 


ABT09479 


Phase-1 Rat CT gen 


34 


237.6 


45. 


4 


317 


24 


AAS16882 


Human insulin-like 


35 


237.6 


45. 


4 


318 


22 


AAD06403 


Human liver-type I 


36 


237.6 


45. 


4 


462 


19 


AAV50426 


Human IGF-1 encodi 


37 


237.6 


45. 


4 


462 


19 


AAV40794 


Human IGF-I coding 


38 


237.6 


45. 


4 


462 


24 


ABZ35734 


Human IGF1 polynuc 


39 


237.6 


45. 


4 


462 


24 


ABX09977 


Human IGFl DNA fra 


40 


237. 6 


45. 


4 


462 


24 


ABV78158 


Human IGFl DNA SEQ 


41 


237.6 


45. 


4 


462 


24 


ABL91699 


Human polynucleoti 


42 


209 


40. 


0 


286 


25 


ABV76186 


Mouse insulin-like 


43 


193 


36. 


9 


210 


24 


AAD45568 


Human insulin-like 


44 


193 


36. 


9 


210 


24 


AAD44955 


Human insulin grow 


45 


193 


36. 


9 


210 


24 


ABA03146 


Native mature IGF- 



ALIGNMENTS 



RESULT 1 
AAD06400 

ID AAD064 00 standard; cDNA; 523 BP. 
XX 

AC AAD06400; 
XX 

DT 10-AUG-2001 (first entry) 
XX 

DE Rabbit IGF-I isoform mechano-growth factor (MGF) cDNA. 
XX 

KW Rabbit; IGF-I isoform; Insulin-like Growth Factor-I; MGF; 

KW mechano-growth factor; neurological disorder; neurodegenerative disorder; 

KW amyotrophic lateral sclerosis; spinal muscular atrophy; muscular atrophy; 

KW poliomyelitis; post-polio syndrome; toxin; motoneurone disorder; 

KW nerve damage; autosomal muscular dystrophy; diabetic neuropathy; 

KW sex-linked muscular dystrophy; peripheral neuropathy; 

KW Alzheimer's disease; Parkinson's disease; ss. 

XX 

OS Oryctolagus cuniculus . 



XX 

FH Key Location/Qualif iers 

FT CDS 1..336 
FT /*tag= a 

FT /product= "Mechano-growth factor (MGF) " 

FT /note= "This region comprises exons 3-6. The CDS does 

FT not include start codon" 

FT /partial 

xx . 



PM WO200136483-A1 . 
XX 

PD 25-MAY-2001. 
XX 

PF 15-NOV-2000; 2 000WO-GB04354 . 
XX 

PR 15-NOV-1999; 99GB-0026968 . 
XX 

PA (UNLO ) UNIV COLLEGE LONDON. 
XX 

PI Goldspink G, Johnson I; 
XX 

DR WPI; 2001-355620/37. 

DR P-PSDB; AAE02449. 
XX 

PT Use of mechano-growth factor, an isoform of Insulin-like Growth 

PT Factor-I, capable of reducing motoneurone loss, in the manufacture of a 

PT medicament for the treatment of neurological disorder - 

XX 

PS Claim 4; Page 53-54; 66pp; English. 
XX 

CC The present invention relates to use of mechano-growth factor (MGF) , 

CC an Insulin-like Growth Factor-I (IGF-I) isoform in the manufacture of a 

CC medicament for the treatment of neurological disorder. The MGF is capable 

CC of reducing motoneurone loss by 20% or greater in response to nerve 

CC avulsion, and effects motoneurone rescue, preferably adult motoneurone 

CC rescue. The MGF polynucleotide and polypeptide are useful in the 

CC manufacture of a medicament for the treatment of a neurological disorder, 

CC including a disorder of motoneurones and/or neurodegenerative disorder, 



CC e.g., amyotrophic lateral sclerosis, spinal muscular atrophy, progressive 

CC spinal muscular atrophy, infantile or juvenile muscular atrophy, 

CC poliomyelitis or post-polio syndrome, a disorder caused by exposure to a 

CC toxin, motoneurone trauma, a motoneurone lesion or nerve damage, an 

CC injury that affects motoneurones , motoneurone loss associated with aging, 

CC autosomal or sex-linked muscular dystrophy, diabetic neuropathy, 

CC peripheral neuropathies, Alzheimer's disease and Parkinson's disease. 

CC The present sequence is rabbit IGF- I isoform MGF cDNA. MGF is a muscle 

CC isoform having extracellular (Ec) domain, hence also referred as 

CC IGF-I-Ec. The MGF protein comprises amino acid sequences encoded by 

CC nucleic acid sequence of IGF- 1 exons 4, 5 and 6 in the reading frame 

CC of MGF. 

XX 

SQ Sequence 523 BP; 154 A; 129 C; 142 G; 98 T; 0 other; 

Query Match 100.0%; Score 523; DB 22; Length 523; 

Best Local Similarity 100.0%; Pred. No. 5.1e-144; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I II I I I I I I I I I I I I 

Db 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGG CTT T TAT T T CAACAAG C CC ACAGGATAC GGCT C CAGCAGT C GGAG GGCACCT C AG 120 

I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 AGGGGCTT T TAT T T CAACAAGC CC ACAGGATAC GGCT C CAGCAGT C GGAGGGCACCT CAG 120 

Qy 121 AC AGGCAT C GT GGAT GAGT GCT GCT T C C GGAG CT GT GAT CT GAGGAGGCT GGAGATGT AC 18 0 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 

Db 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

I I I I I I I I I I I I I 11 I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

Qy 241 AT GCC CAAGACT C AGAAGT AT CAGCCT C CAT CT AC CAACAAGAAAAT GAAGT CT CAGAGG 300 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 AT GCC CAAGACT C AGAAGT AT CAGC CT C CAT CT AC CAACAAGAAAAT GAAGT CT CAGAGG 300 

Qy 301 AGAAGGAAAGGAAGT AC ATT T GAAGAACACAAGT AGAGGGAGT GC AGGAAACAAGAACT A 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 AGAAGGAAAGGAAGT AC ATT T GAAGAACACAAGT AGAGGGAGT GC AGGAAACAAGAACT A 360 

Qy 361 CAGGATGTAGGAAGACCCTT CTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTT G 420 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I 

Db 361 C AGGAT GT AGGAAGACC CTT CT GAGGAGT GAAGAAGGACAGGC C AC CGC AGGAC CCTT T G 420 

Qy 421 CT CT GCACAGT T AC CT GTAAACATT GGAATACC GGC CAAAAAAT AAGTT T GAT C ACATT T 4 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 421 CT CT GCACAGT T AC CT GTAAACATT GGAATACC GGC CAAAAAAT AAGTT T GAT C ACATT T 4 80 

Qy 481 CAAAGATGGCATTT CCCCCAATGAAATACACAAGTAAACATT C 523 

I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 481 CAAAGATGGCATTT CCCCCAAT GAAATACACAAGTAAACATT C 523 



RESULT 2 



AAS16879 

ID AAS16879 standard; cDNA; 523 BP. 
XX 

AC AAS16879; 
XX 

DT 25-FEB-2002 (first entry) 
XX 

DE Rabbit mechano-growth factor (MGF) cDNA. 
XX 

KW Rabbit; mechano-growth factor; insulin-like growth factor I; IGF-I; MGF; 

KW neuroprotective; nerve damage; peripheral nervous system; nerve severing; 

KW muscle; neurological disorder; motoneuron loss; motorneuron disorder; ss; 

KW nerve avulsion. 
XX 

OS Oryctolagus cuniculus. 



XX 

FH Key Location/Qualifiers 

FT CDS 1..336 

FT /*tag= a 

FT /product= "Rabbit MGF" 

FT /partial 

FT /note- "No start codon" 

FT exon 1.-76 

FT /*tag= b 

FT / number = 3 

FT exon 77.. 259 

FT /*tag= c 

FT /number= 4 

FT exon 260.. 309 

FT /*tag= d 

FT /number= 5 

FT exon 311.. 333 

FT /*tag= e 

FT /number^ 6 

XX 



PN WO200185781-A2. 
XX 

PD 15-NOV-2001. 
XX 

PF 10-MAY-2001; 2001WO-GB02 054 . 
XX 

PR 10-MAY-2000; 2000GB-0011278 . 
XX 

PA (UNLO ) UNIV COLLEGE LONDON. 

PA (EGRI-) EAST GRINSTEAD MEDICAL RES TRUST. 

XX 

PI Goldspink G, Terenghi G; 
XX 

DR WPI; 2002-055585/07. 

DR P-PSDB; AAU10561. 
XX 

PT Use of insulin-like growth factor I (IGF-I) isoform known as 

PT mechano-growth factor which is encoded by IGF-I exons 4,5,6 and has 

PT ability to reduce motoneuron loss in response to nerve avulsion, to 

PT treat nerve damage 

XX 

PS Disclosure; Fig 7; 65pp; English. 



XX 

CC The invention relates to the use of an insulin-like growth factor I 

CC (IGF-I) isoform, known as mechano-growth factor (MGF) , in the manufacture 

CC of a medicament for treating nerve damage in the peripheral nervous 

CC system, or for treating nerve damage by localising MGF at the site of 

CC damage. The nerve damage may include severing of a nerve. The treatment 

CC may be combined with another treatment (such as a polypeptide growth 

CC factor other than MGF) that prevents or diminishes degeneration of the 

CC target organ (for example, muscle) which the damaged nerve innervates, 

CC • whereby the treatment of the muscle with MGF or a polynucleotide encoding 

CC MGF prevents or diminishes degeneration. The method is useful for 

CC treating neurological disorders, preferably motorneuron disorders. These 

CC methods can reduce motoneuron loss by 20% or greater in response to nerve 

CC avulsion. This sequence represents cDNA encoding the rabbit MGF. 

XX 

SQ Sequence 523 BP; 154 A; 129 C; 142 G; 98 T; 0 other; 



Query Match 100.0%; Score 523; DB 24; Length 523; 

Best Local Similarity 100.0%; Pred. No. 5.1e-144; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I II 
Db 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 



Qy 61 AGGGGCT T T T ATT T CAACAAGC C CACAGGAT AC GGCT C CAGC AGT C GGAGGGC ACCT C AG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 



Qy 121 AC AGGCAT CGT GGAT GAGT GCT GCT T C C GGAGCT GT GAT CT GAGGAGGCT GGAGAT GT AC 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 121 AC AGGCAT C GT GGAT GAGT GCTGCTTC C GGAGCT GT GAT CT GAGGAGGCT GGAGAT GT AC 18 0 



Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I II II I I I I I I I I I 
Db 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 



Qy 241 AT GC CCAAGACT CAGAAGT AT CAGC CT C CAT CT AC CAACAAGAAAAT GAAGT CT CAGAGG 300 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I 

Db 241 AT GC CCAAGACT CAGAAGT AT CAGC CT C CAT CT AC CAACAAGAAAAT GAAGT CT CAGAGG 300 

Qy 301 AGAAGGAAAGGAAGTACATTTGAAGAACACXAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 AGAAGGAAAG GAAGT AC AT T T GAAGAACACAAGT AGAGGGAGTGC AGGAAACAAGAACT A 360 



Qy 361 CAGGATGT AGGAAGACCCTT CT GAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTT G 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 CAG GAT GT AG GAAGAC C CT T CT GAGGAGT GAAGAAGGAC AGGCC AC C GCAGGAC CCT T T G 420 



Qy 421 CT CT GCACAGTTACCT GTAAACATT GGAATACCGGCCAAAAAATAAGTTT GAT CACATTT 48 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 CT CT GCACAGTTACCT GTAAACATT GGAATACC GGCCAAAAAATAAGTTT GAT CACATTT 480 



Qy 481 CAAAGAT GGCATTTCCCCCAAT GAAATACACAAGTAAACATTC 523 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 CAAAGAT GGCATTTCCCCCAAT GAAATACACAAGTAAACATTC 523 



RESULT 3 
AAT84893 

ID AAT84893 standard; cDNA; 553 BP. 
XX 

AC AAT84893; 
XX 

DT 14-APR-1998 (first entry) 
XX 

DE Rabbit insulin like growth factor 1 encoding cDNA. 
XX 

KW Insulin like growth factor 1; IGF-1; Ec peptide; muscle disorder; 

KW heart; neuromuscular disease; primer; ss. 

XX 

OS Oryctolagus cuniculus . 
XX 

FH Key Location/Qualifiers 

FT CDS 1..366 

FT /*tag= a 

FT /product= "IGF-1" 

XX 

PN W09733997-A1. 
XX 

PD 18-SEP-1997. 
XX 

PF ll-MAR-1997; 97WO-GB00658 . 
XX 

PR ll-MAR-1996; 96GB-0005124 . 
XX 

PA (UNLO ) ROYAL FREE HOSPITAL SCHOOL MED. 
XX 

PI Goldspink G; 
XX 

DR WPI; 1997-470877/43. 

DR P-PSDB; AAW23301. 
XX 

PT Use of insulin like growth factor I characterised by presence of Ec 

PT peptide - to treat humans or animals, particularly muscle disorders, 

PT heart conditions or neuromuscular diseases 
XX 

PS Disclosure; Fig 3; 33pp; English. 
XX 

CC A use of insulin like growth factor I (IGF-1) has been developed, and 

CC is characterised by the presence of the Ec peptide, or a functional 

CC equivalent, in the treatment or therapy of a human or animal. The IGF-1 

CC polypeptide can be used to treat muscular disorders, e.g. Duchenne or 

CC Becker muscular dystrophy, autosomal dystrophies and related progressive 

CC skeletal muscle weakness and wasting, muscle atrophy in ageing humans, 

CC spinal cord injury induced muscle atrophy and neuromuscular diseases, 

CC and cardiac disorders, e.g. diseases where promotion of cardiac muscle 

CC protein synthesis is a beneficial treatment, cardiomyopathies and acute 

CC heart failure or insult, specifically myocarditis or myocardial 

CC infarction. It can also be used to promote bone fracture healing and 

CC maintenance of bone in old age. The present sequence encodes rabbit 

CC IGF-1 used in the present specification. 

XX 

SQ Sequence 553 BP; 159 A; 142 C; 147 G; 105 T; 0 other; 



Query Match 100.0%; Score 523; DB 18; Length 553; 

Best Local Similarity 100.0%; Pred. No. 5.3e-144; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 31 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 90 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 91 AGGGGCTTT T AT T T CAACAAGCC CACAGGAT AC GGCT C CAGC AGT C GGAGGGCAC CT CAG 150 

Qy 121 AC AGGCAT C GT GGAT GAGT GCT GCT T C C GGAG CT GTGAT CT GAGGAGGCT GGAGAT GT AC 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 151 ACAGGCAT C GT GGAT GAGT GCTGCT T C CGGAGCT GT GAT CT GAGGAGGCT GGAGAT GT AC 210 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 211 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 27 0 

Qy 241 AT GC C CAAGACT CAGAAGT AT CAG C CT C CAT CTAC CAACAAGAAAAT GAAGTCT C AGAGG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I 
Db 271 AT G C C CAAGACT CAGAAGT AT CAGCCT C CAT CTAC CAACAAGAAAAT GAAGT CT CAGAGG 330 

Qy 301 AGAAGGAAAG GAAGT AC ATT T GAAGAAC ACAAGT AGAGGGAGT GCAGGAAACAAGAACT A 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I M I I I I I I 
Db 331 AGAAGGAAAGGAAGT AC AT T T GAAGAACACAAGT AGAGGGAGT G C AGGAAACAAGAACT A 390 

Qy 361 CAGGAT GT AGGAAGAC C CT T CT GAGGAGT GAAGAAGGAC AGGC C ACC GCAGGACC CTT T G 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 391 CAGGAT GT AGGAAGAC C CT T CT GAGGAGT GAAGAAGGAC AGGC C ACC GCAGGACCCT TT G 450 

Qy 421 CT CT GCACAGT TAC CT GTAAACATT GGAAT AC C GGC CAAAAAATAAGT TT GAT CACATT T 4 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 451 CT CTGCACAGTTACCT GTAAACATTGGAATACCGGCCAAAAAATAAGTTT GATCACATTT 510 

Qy 481 CAAAGAT GGCAT T T C C C CCAAT GAAATACACAAGT AAACATT C 523 

I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 

Db 511 CAAAGAT GGCAT T T C C C C CAAT GAAATACACAAGT AAACAT T C 553 



RESULT 4 
AAD06398 



ID AAD06398 standard; cDNA; 517 BP. 
XX 

AC AAD06398; 
XX 

DT 10-AUG-2001 (first entry) 
XX 

DE Human IGF-I isoform mechano-growth factor (MGF) cDNA. 
XX 

KW Human; IGF-I isoform; Insulin-like Growth Factor-I; MGF; 

KW mechano-growth factor; neurological disorder; neurodegenerative disorder; 

KW amyotrophic lateral sclerosis; spinal muscular atrophy; muscular atrophy; 

KW poliomyelitis; post-polio syndrome; toxin; motoneurone disorder; 

KW nerve damage; autosomal muscular dystrophy; diabetic neuropathy; 



KW sex-linked muscular dystrophy; peripheral neuropathy; 

KW Alzheimer's disease; Parkinson's disease; ss. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 1..333 

FT /*tag= a 

FT /product= "Mechano-growth factor (MGF) " 

FT /note= "This region comprises exons 3-6. The CDS does 

FT not include start codon" 

FT /partial 

XX 

PN WO200136483-A1. 
XX 

PD 25-MAY-2001. 
XX 

PF 15-NOV-2000; 2000WO-GB04354 . 
XX 

PR 15-NOV-1999; 99GB-0026968 . 
XX 

PA (UNLO ) UNIV COLLEGE LONDON. 
XX 

PI Goldspink G, Johnson I ; 
XX 

DR WPI; 2001-355620/37. 

DR P-PSDB; AAE02447. 
XX 

PT Use of mechano-growth factor, an isoform of Insulin-like Growth 

PT Factor-I, capable of reducing motoneurone loss, in the manufacture of a 

PT medicament for the treatment of neurological disorder - 

XX 

PS Claim 4; Page 49-50; 66pp; English. 
XX 

CC The present invention relates to use of mechano-growth factor (MGF) , 

CC an Insulin-like Growth Factor-I (IGF-I) isoform in the manufacture of a 

CC medicament for the treatment of neurological disorder. The MGF is capable 

CC of reducing motoneurone loss by 20% or greater in response to nerve 

CC avulsion, and effects motoneurone rescue, preferably adult motoneurone 

CC rescue. The MGF polynucleotide and polypeptide are useful in the 

CC manufacture of a medicament for the treatment of a neurological disorder, 

CC including a disorder of motoneurones and/or neurodegenerative disorder, 

CC e.g., amyotrophic lateral sclerosis, spinal muscular atrophy, progressive 

CC spinal muscular atrophy, infantile or juvenile muscular atrophy, 

CC poliomyelitis or post-polio syndrome, a disorder caused by exposure to a 

CC toxin, motoneurone trauma, a motoneurone lesion or nerve damage, an 

CC injury that affects motoneurones, motoneurone loss associated with aging, 

CC autosomal or sex-linked muscular dystrophy, diabetic neuropathy, 

CC peripheral neuropathies, Alzheimer* s disease and Parkinson 1 s disease. 

CC The present sequence is human IGF-I isoform MGF cDNA. MGF is a muscle 

CC isoform having extracellular (Ec) domain, hence also referred as 

CC IGF-I-Ec. The MGF protein comprises amino acid sequences encoded by 

CC nucleic acid sequence of IGF-I exons 4, 5 and 6 in the reading frame 

CC of MGF. 

XX 

SQ Sequence 517 BP; 150 A; 130 C; 139 G; 98 T; 0 other; 



Query Match 89.4%; Score 467.4; DB 22; Length 517; 

Best Local Similarity 96.2%; Pred. No. 1.3e-127; 

Matches 501; Conservative 0; Mismatches 16; Indels 4; Gaps 2; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGG GGCT T T T ATT T CAACAAGC C CAC AG GAT AC GGCT C CAGCAGT C GGAGGGC ACCT CAG 12 0 

I I I I I I I I I I I I I I M I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 120 

Qy 121 AC AGGCAT C GT GGAT GAGT GCT GCT T C C GGAGC T GT GAT CT GAGGAGGCT GGAGAT GTAC 180 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 ACAGGCAT C GT GGAT GAGT GCTGCTTCC GGAG CT GTGAT CTAAGGAGGCT GGAGAT GT AT 18 0 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

II I I I I I I I I I I I I I I II III I II I I I I I I I I I I I I I I I I I I I I I I II I I I I II 

Db 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 

Qy 241 AT G C C CAAGACT CAGAAGT AT CAGC CT C CAT CTAC CAACAAGAAAAT GAAGT CT CAGAGG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I 
Db 241 ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAACAAGAACACGAAGTCTCA— G 297 

Qy 301 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I II I I II I M I I I I I I I I I I I I I 
Db 298 AGAAGGAAAGGAAGTACATTT GAAGAACACAAGT AGAGGGAGT GCAGGAAACAAGAACTA 357 

Qy 361 C AGGATGT AGGAAGAC C CTT CT GAGGAGT GAAGAAGGACAGGCCAC C GCAGGAC CCT TT G 420 

I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 358 CAGGATGT A- GAAGAC C CTT CT GAGGAGT GAAGAAGGACAGGC CAC CGCAGGAC C CT T T G 416 

Qy 421 CT CT G CAC AGTT AC CT GTAAAC ATT GGAAT AC C GGC CAAAAAATAAGT T T GAT CACAT T T 480 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 417 CT CT GCACAGT T AC CT GT AAAC AT TGGAAT AC C GGCCAAAAAATAAGT T T GAT CACATT T 476 

Qy 4 81 CAAAGATGGCATTT CCCCCAAT GAAATACACAAGTAAACAT 521 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 477 CAAAGAT GGCATTT C C C CCAAT GAAATACACAAGTAAACAT 517 



RESULT 5 
AAS16877 

ID AAS16877 standard; cDNA; 517 BP. 
XX 

AC AAS16877; 
XX 

DT 25-FEB-2002 (first entry) 
XX 

DE Human mechano-growth factor (MGF) cDNA. 
XX 



KW Human; mechano-growth factor; insulin-like growth factor I; IGF-I; MGF; 

KW neuroprotective; nerve damage; peripheral nervous system; nerve severing; 

KW muscle; neurological disorder; motoneuron loss; motorneuron disorder; ss; 

KW nerve avulsion. 
XX 

OS Homo sapiens . 



XX 

FH Key Location/Qualifiers 

FT CDS 1..333 

FT /*tag= a 

FT /product^ "Human MGF" 

FT /partial 

FT /note= "No start codon" 

FT exon 1. . 76 

FT /*tag= b 

FT /number^ 3 

FT exon 77.. 259 

FT /*tag= c 

FT / number = 4 

FT exon 260. .307 

FT /*tag= d 

FT /number^ 5 

FT exon 308. .330 

FT /*tag= e 

FT /number= 6 

XX 



PN WO200185781-A2. 
XX 

PD 15-NOV-2001. 
XX 

PF 10-MAY-2001; 2001WO-GB02054 . 
XX 

PR 10-MAY-2000; 2 OOOGB-001127 8 . 
XX 

PA (UNLO ) UNIV COLLEGE LONDON. 

PA (EGRI-) EAST GRINSTEAD MEDICAL RES TRUST. 

XX 

PI Goldspink G, Terenghi G; 
XX 

DR WPI; 2002-055585/07. 

DR P-PSDB; AAU10559. 
XX 

PT Use of insulin-like growth factor I (IGF-I) isoform known as 

PT mechano-growth factor which is encoded by IGF-I exons 4,5,6 and has 

PT ability to reduce motoneuron loss in response to nerve avulsion, to 

PT treat nerve damage 

XX 

PS Claim 11; Fig 5; 65pp; English. 
XX 

CC The invention relates to the use of an insulin-like growth factor I 

CC (IGF-I) isoform, known as mechano-growth factor (MGF), in the manufacture 

CC of a medicament for treating nerve damage in the peripheral nervous 

CC system, or for treating nerve damage by localising MGF at the site of 

CC damage. The nerve damage may include severing of a nerve. The treatment 

CC may be combined with another treatment (such as a polypeptide growth 

CC factor other than MGF) that prevents or diminishes degeneration of the 

CC target organ (for example, muscle) which the damaged nerve innervates, 

CC whereby the treatment of the muscle with MGF or a polynucleotide encoding 

CC MGF prevents or diminishes degeneration. The method is useful for 

CC treating neurological disorders, preferably motorneuron disorders. These 

CC methods can reduce motoneuron loss by 2 0% or greater in response to nerve 

CC avulsion. This sequence represents cDNA encoding the human MGF. 

XX 



SQ Sequence 517 BP; 150 A; 130 C; 139 G; 98 T; 0 others- 



Query Match 89.4%; Score 467.4; DB 24; Length 517; 

Best Local Similarity 96.2%; Pred. No. 1.3e-127; 

Matches 501; Conservative 0; Mismatches 16; Indels 4; Gaps 2; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I.I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I 

Db 61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 120 

Qy 121 ACAGGC AT C GT GGAT GAGT GCT GCTT CCGGAG CT GT GAT CT GAGGAGGCT GGAGATGT AC 180 

I I II I I I I I I I I I I I I I II II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 ACAGGC AT C GT GGAT GAGT GCT GCTT CCG GAGCT GT GAT CTAAGGAGGCT G GAGAT GT AT 180 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

II I I I I I I I I I I I I I I II III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 240 

Qy 241 ATGCCCAAGACT CAGAAGT AT CAGCCTCCAT CTACCAACAAGAAAAT GAAGT CT CAGAGG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 241 AT GCCCAAGAC CCAGAAGTAT CAGCCCCCAT CT ACCAACAAGAACACGAAGT CT CA G 297 

Qy 301 AGAAGGAAAGGAAGT ACAT T T GAAGAACACAAGT AGAGGGAGT GC AGGAAACAAGAACT A 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 298 AGAAGGAAAGGAAGT ACAT TT GAAGAACACAAGT AGAGGGAGT GC AGGAAACAAGAACT A 357 

Qy 361 C AGGAT GT AGGAAGACCCT T CT GAGGAGT GAAGAAGGAC AGGC CAC C G CAGGAC CCT TT G 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 358 C AGGAT GT A- GAAGACC CTT CT GAGGAGT GAAGAAGGAC AGGC CAC C G CAGGAC C CT TT G 416 

Qy 421 CT CT GC AC AGT T AC CT GT AAAC AT T GGAAT AC CGGCCAAAAAATAAGTTT GAT C ACATTT 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 417 CT CT GC AC AGT T AC CT GT AAAC AT T GGAAT AC CGGCCAAAAAATAAGTTT GAT C ACATTT 476 

Qy 481 CAAAGAT GGC AT TT C C C C CAAT GAAAT ACACAAGT AAAC AT 521 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 477 CAAAGAT GGC AT TT C C C C CAAT GAAAT ACACAAGT AAACAT 517 



RESULT 6 
AAD06405 

ID AAD06405 standard; cDNA; 471 BP. 
XX 

AC AAD06405; 
XX 

DT 10-AUG-2001 (first entry) 
XX 

DE Rabbit liver-type IGF-I isoform (L. IGF-I) cDNA. 
XX 

KW Rabbit; IGF-I isoform; Insulin-like Growth Factor-I; MGF; 

KW mechano-growth factor; neurological disorder; neurodegenerative disorder; 

KW amyotrophic lateral sclerosis; spinal muscular atrophy; muscular atrophy; 

KW poliomyelitis; post-polio syndrome; toxin; motoneurone disorder; 



KW nerve damage; autosomal muscular dystrophy; diabetic neuropathy; 

KW sex-linked muscular dystrophy; peripheral neuropathy; 

KW Alzheimer's disease; Parkinson's disease; liver; L.IGF-I; ss. 

XX 

OS Oryctolagus cuniculus. 



XX 

FH Key Location/Qualif iers 

FT CDS 1..318 

FT /*tag= a 

FT /product= "Liver-type IGF-I isoform (L.IGF-I)" 

FT /transl_except= (pos:7..9, aa:Gln) 

FT /transl_except= (pos:25..27, aa:Gln) 

FT /note= "These translation exceptions occur while decoding 

FT the alternative version of the protein (AAE02456) . 

FT The CDS comprises exons 3, 4 and 6 and 

FT does not include start codon" 

FT /partial 

XX 



PN WO200136483-A1. 
XX 

PD 25-MAY-2001. 
XX 

PF 15-NOV-2000; 2000WO-GB04354 . 
XX 

PR 15-NOV-1999; 99GB-0026968 . 
XX 

PA (UNLO ) UNIV COLLEGE LONDON. 
XX 

PI Goldspink G, Johnson I; 
XX 

DR WPI; 2001-355620/37. 

DR P-PSDB; AAE02452, AAE02456. 

XX 

PT Use of mechano-growth factor, an isoform of Insulin-like Growth 

PT Factor-I, capable of reducing motoneurone loss, in the manufacture of a 

PT medicament for the treatment of neurological disorder - 

XX 

PS Disclosure; Page 59-60; 66pp; English. 
XX 

CC The present invention relates to use of mechano-growth factor (MGF) , 

CC an Insulin-like Growth Factor-I (IGF-I) isoform in the manufacture of a 

CC medicament for the treatment of neurological disorder. The MGF is capable 

CC of reducing motoneurone loss by 20% or greater in response to nerve 

CC avulsion, and effects motoneurone rescue, preferably adult motoneurone 

CC rescue. The MGF polynucleotide and polypeptide are useful in the 

CC manufacture of a medicament for the treatment of a neurological disorder, 

CC including a disorder of motoneurones and/or neurodegenerative disorder, 

CC e.g., amyotrophic lateral sclerosis, spinal muscular atrophy, progressive 

CC spinal muscular atrophy, infantile or juvenile muscular atrophy, 

CC poliomyelitis or post-polio syndrome, a disorder caused by exposure to a 

CC toxin, motoneurone trauma, a motoneurone lesion or nerve damage, an 

CC injury that affects motoneurones, motoneurone loss associated with aging, 

CC autosomal or sex-linked muscular dystrophy, diabetic neuropathy, 

CC peripheral neuropathies, Alzheimer ! s disease and Parkinson's disease. 

CC The present sequence is rabbit liver-type IGF-I isoform (L.IGF-I) cDNA. 

CC The L.IGF-I protein comprises amino acid sequences encoded by 

CC nucleic acid sequence of IGF-I exons 4 and 6. 



XX 

SQ Sequence 471 BP; 132 A; 118 C; 131 G; 90 T; 0 other; 



Query Match 78.2%; Score 409; DB 22; Length 471; 

Best Local Similarity 90.1%; Pred. No. 2e-110; 

Matches 471; Conservative 0; Mismatches 0; Indels 52; Gaps 1; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCTT TT AT T T CAACAAGC C CACAGGATAC GGCT C C AGCAGT C GGAGGG CAC CT C AG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I 
Db 61 AGGGGCTT T TAT T T CAACAAGC C CACAGGATAC GGCT C CAGCAGT C GGAGGGCAC CT CAG 120 

Qy 121 ACAGGCAT C GT GGAT GAGT GCT GCTT CC GGAGCT GT GAT CT GAGGAG GCTGGAGAT GT AC 180 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 121 ACAGGCAT C GT GGAT GAGT GCTG CT T C C GGAGCT GT GAT CT GAGGAGGCTGGAGAT GT AC 180 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

Qy 241 AT GCC CAAGACT CAGAAGTATCAGCCT CCAT CT ACCAACAAGAAAAT GAAGTCT CAGAGG 300 

I I I I I I I I I I I I I I I 

Db 241 AT GCC CAAGAC T CAG 255 

Qy 3 01 AGAAGGAAAGGAAGT AC AT TT GAAGAAC ACAAGT AGAGGGAGT GCAG GAAACAAGAACT A 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 256 AAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 308 

Qy 361 CAGGAT GT AGGAAGAC C CT T CT GAGGAGT GAAGAAGGAC AGGCCACC GCAGGACC CTT T G 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 309 CAGGAT GT AGGAAGAC C CTT CT GAGGAGT GAAGAAGGAC AGGCCACC GCAGGAC C CT T T G 368 

Qy 421 CT CT GC ACAGT T AC CT GTAAACAT T GGAATAC C GGCCAAAAAATAAGTTT GAT C AC AT TT 480 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 369 CT CT G CACAGTT AC CT GTAAACAT T GGAATAC C GGCCAAAAAATAAGTTT GAT CAC AT T T 42 8 

Qy 481 CAAAGATGGC AT T T C CC C CAAT GAAAT AC ACAAGTAAACAT T C 523 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 42 9 CAAA.GATGGC ATTT C C C C CAAT GAAAT AC ACAAGTAAACAT T C 471 



RESULT 7 
AAS16884 

ID AAS16884 standard; cDNA; 471 BP. 
XX 

AC AAS16884; 
XX 

DT 25-FEB-2002 (first entry) 
XX 

DE Rabbit insulin-like growth factor I liver-type isoform (L.IGF-I) cDNA. 
XX 

KW Rabbit; mechano-growth factor; insulin-like growth factor I; IGF-I; MGF; 

KW neuroprotective; nerve damage; peripheral nervous system; nerve severing; 

KW muscle; neurological disorder; motoneuron loss; motorneuron disorder; ss; 



KW nerve avulsion; insulin-like growth factor I liver-type isoform; L.IGF-I; 
XX 

OS Oryctolagus cuniculus . 



XX 

FH Key Location/Qualif iers 

FT CDS 1..318 
FT /*tag= a 

FT /product= "Rabbit L.IGF-I" 

FT /partial 

FT /note= "No start codon" 

FT exon 1. . 75 

FT /*tag= b 

FT / number^ exon 3 

FT exon 76.. 258 

FT /*tag= c 

FT /number^ exon 4 

FT exon 259.. 315 

FT /*tag= d 

FT / number = exon 6 

XX 



PN WO200185781-A2 . 
XX 

PD 15-NOV-2001. 
XX 

PF 10-MAY-2001; 2001WO-GB02054 . 
XX 

PR 10-MAY-2000; 2000GB-0011278 . 
XX 

PA (UNLO ) UNIV COLLEGE LONDON. 

PA (EGRI-) EAST GRINSTEAD MEDICAL RES TRUST. 

XX 

PI Goldspink G, Terenghi G; 
XX 

DR WPI; 2002-055585/07. 

DR P-PSDB; AAU10564. 
XX 

PT Use of insulin-like growth factor I (IGF-I) isoform known as 

PT mechano-growth factor which is encoded by IGF-I exons 4,5,6 and has 

PT ability to reduce motoneuron loss in response to nerve avulsion, to 

PT treat nerve damage 

XX 

PS Disclosure; Fig 10; 65pp; English. 
XX 

CC The invention relates to the use of an insulin-like growth factor I 

CC (IGF-I) isoform, known as mechano-growth factor (MGF) , in the manufacture 

CC of a medicament for treating nerve damage in the peripheral nervous 

CC system, or for treating nerve damage by localising MGF at the site of 

CC damage. The nerve damage may include severing of a nerve. The treatment 

CC may be combined with another treatment (such as a polypeptide growth 

CC factor other than MGF) that prevents or diminishes degeneration of the 

CC target organ (for example, muscle) which the damaged nerve innervates, 

CC whereby the treatment of the muscle with MGF or a polynucleotide encoding 

CC MGF prevents or diminishes degeneration. The method is useful for 

CC treating neurological disorders, preferably motorneuron disorders. These 

CC methods can reduce motoneuron loss by 20% or greater in response to nerve 

CC avulsion. This sequence represents cDNA encoding the rabbit insulin-like 

CC growth factor I liver-type isoform (L.IGF-I) used in experiments on 



cc 

XX 
SQ 



motoneuron loss. 

Sequence 471 BP; 132 A; 118 C; 131 G; 90 T; 0 other; 



Query Match 78.2%; Score 409; DB 24; Length 471; 

Best Local Similarity 90.1%; Pred. No. 2e-110; 

Matches 471; Conservative 0; Mismatches 0; Indels 52; Gaps 1; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCTT T TAT TTCAACAAGC CCACAGGATACGGCT C CAGCAGT C GGAG GGCAC CT CAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

Qy 121 ACAGGCAT CGT GGAT GAGT GCTGCTT CCGGAGCTGT GAT CT GAGGAGGCT GGAGAT GTAC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 ACAGGCAT C GT GGAT GAGT GCT GCTT C C GGAGCT GT GAT CT GAGGAGGCT GGAGAT GTAC 180 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

Qy 241 ATGCCCAAGACT CAGAAGTAT CAGCCT CCATCTACCAACAAGAAAATGAAGTCTCAGAGG 300 

I I I I I I I I I I I I I I I 

Db 241 ATGCCCAAGACT CAG 255 

Qy 301 AGAAGGAAAGGAAGTAC AT T T GAAGAACACAAGT AGAGGGAGT GCAGGAAACAAGAACT A 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 256 AAGGAAGTAC AT T T GAAGAACACAAGT AGAGGGAGT GCAGGAAACAAGAACT A 308 

Qy 361 CAGGATGTAGGAAGACCCTT CT GAGGAGT GAAGAAGGACAGGCCACCGCAGGACCCTTT G 420 

I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I 
Db 309 C AGGAT GT AGGAAGAC C CTT CT GAGGAGT GAAGAAGGACAGGCCAC C GCAGGACC CT TT G 368 

Qy 421 CTCT GCACAGTT ACCT GTAAACATTGGAATACCGGCCAAAAAATAAGTTT GATCACATTT 480 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 369 CTCT GCACAGTT ACCT GTAAACATT GGAAT ACCGGC CAAAAAATAAGTT T GAT CAC AT T T 428 

Qy 481 CAAAGAT GGCATTTCCCCCAATGAAATACACAAGTAAACATT C 523 

I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 
Db 429 CAAAGAT G G CAT T T C C C C C AAT G AAAT AC AC AAGT AAAC AT T C 471 



RESULT 8 
AAD06399 

ID AAD06399 standard; cDNA; 539 BP. 
XX 

AC AAD06399; 
XX 

DT 10-AUG-2001 (first entry) 
XX 

DE Rat IGF-I isoform mechano-growth factor (MGF) cDNA. 
XX 

KW Rat; IGF-I isoform; Insulin-like Growth Factor-I; MGF; 

KW mechano-growth factor; neurological disorder; neurodegenerative disorder; 



KW amyotrophic lateral sclerosis; spinal muscular atrophy; muscular atrophy; 

KW poliomyelitis; post-polio syndrome; toxin; motoneurone disorder; 

KW nerve damage; autosomal muscular dystrophy; diabetic neuropathy; 

KW sex-linked muscular dystrophy; peripheral neuropathy; 

KW Alzheimer's disease; Parkinson's disease; ss. 

XX 

OS Rattus sp. 



XX 

FH Key Location/Qualif iers 

FT CDS 1..336 
FT /*tag= a 

FT /product= "Mechano-growth factor (MGF) " 

FT /note= "This region comprises exons 3-6. The CDS does 

FT not include start codon" 

FT /partial 

XX 



PN WO200136483-A1. 
XX 

PD 25-MAY-2001. 
XX 

PF 15-NOV-2000; 2 000WO-GB04354 . 
XX 

PR 15-NOV-1999; 99GB-002 6968 . 
XX 

PA (UNLO ) UNIV COLLEGE LONDON. 
XX 

PI Goldspink G, Johnson I; 
XX 

DR WPI; 2001-355620/37. 

DR P-PSDB; AAE02448. 
XX 

PT Use of mechano-growth factor, an isoform of Insulin-like Growth 

PT Factor-I, capable of reducing motoneurone loss, in tlie manufacture of a 

PT medicament for the treatment of neurological disorder - 

XX 

PS Claim 4; Page 51-52; 66pp; English. 
XX 

CC The present invention relates to use of mechano-growth factor (MGF) , 

CC an Insulin-like Growth Factor-I (IGF-I) isoform in the manufacture of a 

CC medicament for the treatment of neurological disorder. The MGF is capable 

CC of reducing motoneurone loss by 20% or greater in response to nerve 

CC avulsion, and effects motoneurone rescue, preferably adult motoneurone 

CC rescue. The MGF polynucleotide and polypeptide are useful in the 

CC manufacture of a medicament for the treatment of a neurological disorder, 

CC including a disorder of motoneurones and/or neurodegenerative disorder, 

CC e.g., amyotrophic lateral sclerosis, spinal muscular atrophy, progressive 

CC spinal muscular atrophy, infantile or juvenile muscular atrophy, 

CC poliomyelitis or post-polio syndrome, a disorder caused by exposure to a 

CC toxin, motoneurone trauma, a motoneurone lesion or nerve damage, an 

CC injury that affects motoneurones, motoneurone loss associated with aging, 

CC autosomal or sex-linked muscular dystrophy, diabetic neuropathy, 

CC peripheral neuropathies, Alzheimer's disease and Parkinson's disease. 

CC The present sequence is rat IGF-I isoform MGF cDNA. MGF is a muscle 

CC isoform having extracellular (Ec) domain, hence also referred as 

CC IGF-I-Ec. The MGF protein comprises amino acid sequences encoded by 

CC nucleic acid sequence of IGF-I exons 4, 5 and 6 in the reading frame 

CC of MGF. 



XX 

SQ Sequence 539 BP; 161 A; 136 C; 139 G; 103 T; 0 other; 

Query Match 68.2%; Score 356.8; DB 22; Length 539; 

Best Local Similarity 82.3%; Pred. No. 5.1e-95; 

Matches 436; Conservative 0; Mismatches 87; Indels 7; Gaps 2; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I II I 

Db 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I III 

Db 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCATTCGGAGGGCACCACAG 120 

Qy 121 ACAGGCAT CGT GGAT GAGTGCT GCTT CCGGAGCT GT GATCTGAGGAGGCTGGAGAT GTAC 180 

II I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I 

Db 121 AC GGGCAT T GT GGAT GAGTGTT G CTT C C GGAGCT GT GATCT GAGGAGGCT GGAGAT GTAC 180 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 240 

Qy 241 AT GCCCAAGACT CAGAAGTATCAGCCT CCAT CTACCAACAAGAAAAT GAAGTCT CAGAGG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 AT GC C CAAGAC T CAGAAGT CCCAGCC C CT AT C GACACACAAGAAAAGGAAGCT GCAAAG G 300 

Qy 301 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

Db 301 AGAAGGAAAGGAAGT ACACTT GAAGAAC ACAAGT AGAG GAAGT G CAG GAAACAAGAC CTA 360 

Qy 361 CAGGAT GT AGGAAGAC C CT T CT GAGGAGT GAAGAAGGACAGGC CAC CGC AGGAC C CT T T G 420 

III I I I I I I I I II III I I I I I I I II I II I I I I I I I I II I I I I I I 

Db 361 CAGAAT GTAGGAGGAGC CTCCCGAGGAACAGAAAAT GCCACGTCACCGCAAGAT CCTTT G 420 

Qy 421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATC 474 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 421 CT GCTT GAGCAAC CT GCAAAACAT C GGAACACCT GC CAAATAT CAAT AAT GAGTT CAAT A 480 

Qy 475 ACATTT CAAAGAT - GGCATTTCCCCCAAT GAAATACACAAGTAAACATTC 523 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I 
Db 481 TCATTT CAGAGAT GGGCATTTCCCT CAAT GAAATACACAAGTAAACATTC 530 



RESULT 9 
AAS16878 

ID AAS16878 standard; cDNA; 539 BP. 
XX 

AC AAS16878; 
XX 

DT 25-FEB-2002 (first entry) 
XX 

DE Rat mechano-growth factor (MGF) cDNA. 
XX 

KW Rat; mechano-growth factor; insulin-like growth factor I; IGF-I; MGF; 

KW neuroprotective; nerve damage; peripheral nervous system; nerve severing; 

KW muscle; neurological disorder; motoneuron loss; motorneuron disorder; ss; 



KW nerve avulsion. 
XX 

OS Rattus sp. 



XX 

FH Key Location/Qualif iers 

FT CDS 1..336 

FT /*tag= a 

FT /product= "Rat MGF" 

FT /partial 

FT /note- "No start codon" 

FT exon 1. .75 

FT /*tag= b 

FT /number^ exon 3 

FT exon 76.. 258 

FT /*tag= c 

FT / number^ exon 4 

FT exon 259. . 309 

FT /*tag= d 

FT /number= exon 5 

FT exon 310. . 333 

FT /*tag= e 

FT /number^ exon 6 

XX 



PN WO200185781-A2. 
XX 

PD 15-NOV-2001. 
XX 

PF 10-MAY-2001; 2001WO-GB02054 . 
XX 

PR 10-MAY-2000; 2000GB-001127 8 . 
XX 

PA (UNLO } UNIV COLLEGE LONDON. 

PA (EGRI-) EAST GRINSTEAD MEDICAL RES TRUST. 

XX 

PI Goldspink G, Terenghi G; 
XX 

DR WPI; 2002-055585/07. 

DR P-PSDB; AAU10560. 
XX 

PT Use of insulin-like growth factor I (IGF-I) isoform known as 

PT mechano-growth factor which is encoded by IGF-I exons 4,5,6 and has 

PT ability to reduce motoneuron loss in response to nerve avulsion, to 

PT treat nerve damage 

XX 

PS Disclosure; Fig 6; 65pp; English. 
XX 

CC The invention relates to the use of an insulin-like growth factor I 

CC (IGF-I) isoform, known as mechano-growth factor (MGF), in the manufacture 

CC of a medicament for treating nerve damage in the peripheral nervous 

CC system, or for treating nerve damage by localising MGF at the site of 

CC damage. The nerve damage may include severing of a nerve. The treatment 

CC may be combined with another treatment (such as a polypeptide growth 

CC factor other than MGF) that prevents or diminishes degeneration of the 

CC target organ (for example, muscle) which the damaged nerve innervates, 

CC whereby the treatment of the muscle with MGF or a polynucleotide encoding 

CC MGF prevents or diminishes degeneration. The method is useful for 

CC treating neurological disorders, preferably motorneuron disorders. These 



CC methods can reduce motoneuron loss by 20% or greater in response to nerve 

CC avulsion. This sequence represents cDNA encoding the rat MGF. 

XX 

SQ Sequence 539 BP; 161 A; 136 C; 139 G; 103 T; 0 other; 



Query Match 68.2%; Score 356.8; DB 24; Length 539; 

Best Local Similarity 82.3%; Pred. No. 5.1e-95; 

Matches 436; Conservative 0; Mismatches 87; Indels 7; Gaps 2; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I II I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Qy 61 AGGGGCT T T T ATTT CAACAAG CC CACAGGATAC GGCT C CAGCAGT C GGAG GGC AC CT CAG 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I III 

Db 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCATTCGGAGGGCACCACAG 120 

Qy 121 AC AG GCAT C GT GGAT GAGT GCT GCTT C C GGAGCT GT GAT CTGAGGAGGCT GGAGAT GT AC 180 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 AC GGGCAT T GT GGAT GAGT GT T GCT T C C GGAGCT GT GAT CTGAGGAGGCT GGAGAT GT AC 18 0 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

Qy 241 AT GCC CAAGACT CAGAAGTAT CAGC CT C CAT CT AC CAACAAGAAAAT GAAGT CT C AGAGG 300 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 241 AT GCCCAAGACT CAGAAGT CCCAGCCCCTATCGACACACAAGAAAAGGAAGCTGCAAAGG 300 

Qy 301 AGAAG GAAAGGAAGT AC AT T T GAAGAAC ACAAGT AGAGGGAGTGC AGGAAACAAGAACT A 360 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I J I I I I I I I III 
Db 301 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 360 

Qy 361 CAGGATGTAGGAAGACCCTT CTGAGGAGT GAAGAAGGACAGGCCACCGCAGGACCCTTT G 420 

I II I I I I I I I I II III I I I I I I I II I II I I I I I I I I II I I I I I I 

Db 361 CAGAAT GT AGGAGGAG C CT C C C GAGGAACAGAAAAT GC CAC GT CACC GCAAGAT C CTT T G 420 

Qy 421 CT CT GCAC AGT T ACCT GT AAACATT GGAATAC CGGC CA AAAAATAAGTTT GAT C 474 

II I II I I II I II I I I I I I I I I I I I I I I I I II 

Db 421 CT GCTT GAGC AAC CT GCAAAACAT C GGAACAC CT GC CAAAT ATCAATAAT GAGTT CAAT A 480 

Qy 47 5 ACATTTCAAAGAT-GGCATTTCCCCC7\ATGAAATACACAAGTAAACATTC 523 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 TCATTTCAGAGAT GGGCATTT CCCTCAATGAAATACACAAGTAAACATT C 530 



RESULT 10 
ABV76185 

ID ABV76185 standard; cDNA; 651 BP. 
XX 

AC ABV7618 5; 
XX 

DT 07-MAR-2003 (first entry) 
XX 

DE Mouse insulin-like growth factor IB cDNA. 
XX 

KW Insulin-like growth factor IB; IGF-IB; mouse; mRNA; assay; 



KW 
XX 
OS 
XX 
FH 
FT 
FT 
FT 



Mus mus cuius. 



nucleic acid detection; gene; ss. 



Key 
CDS 



Location/Qualifiers 



73. .474 
/*tag= a 

/product^ "IGF-IB" 



XX 

PN WO200297390-A2 . 
XX 

PD 05-DEC-2002. 
XX 

PF 31-MAY-2002; 2 002WO-SE01056 . 
XX 

PR 01-JUN-2001; 2001SE-0001934 . 
XX 

PA (BIOV-) BIOVITRUM AB. 
XX 

PI Parrow V, Rosengren L; 
XX 

DR WPI; 2003-129529/12. 
XX 

PT Quantitating a target nucleic acid in a sample comprises immobilizin 

PT on a solid support, a sample comprising a target nucleic acid, and 

PT detecting and quantitating signals generated from the antisense and 

PT sense probes - 
XX 

PS Example 1; Page 16-17; 18pp; English. 
XX 

CC The present sequence is that of cDNA encoding murine insulin-like 

CC growth factor IB (IGF-IB) . The cDNA was used in an example of the 

CC method of the invention to generate probes for determination, of 

CC IGF-IB RNA. The method comprises a quantitative hybridisation 

CC assay for analysis of mRNA in a target nucleic acid (TNA) sample. 

CC It involves: (i) immobilising the TNA sample on a solid support; 

CC (ii) contacting a labelled antisense probe to a first portion of the 

CC TNA, and a labelled sense probe to a second portion of the TNA; 

CC (iii) detecting and quantitating the signals generated from the 

CC hybridised probes; and (iv) determining the value represented by 

CC the antisense probe signal minus the sense probe signal, the value 

CC being proportional to the amount of mRNA in the TNA sample. In an 

CC example of the method, a cDNA clone containing 60 nucleotides from 

CC exon 2 and 179 nucleotides from exon 3 of the mouse IGF-IB gene was 

CC cloned into pGEN-4Z vector. Linearisation of the plasmid with 

CC EcoRI allowed transcription of a 250-nucleotide antisense probe 

CC using T7 polymerase. Linearisation with Hindlll allowed 

CC transcription of a sense probe of similar length using SP6 

CC polymerase (see ABV76186) . The probes were purified and used to 

CC determine IGF-I RNA in mouse hepatocytes and also in rat hepatocytes 

XX 

SQ Sequence 651 BP; 193 A; 185 C; 149 G; 124 T; 0 other; 



Query Match 66.8%; Score 349.4; DB 25; Length 651; 

Best Local Similarity 82.8%; Pred. No. 8.3e-93; 

Matches 425; Conservative 0; Mismatches 81; Indels 7; Gaps 



Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I II I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Dfo 139 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCG 198 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 199 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCTCAG 258 

Qy 121 ACAGG C AT C GT GGAT GAGT GCT GCTT C C GGAGCT GT GAT CT GAGGAGG CT GGAGAT GT AC 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 259 ACAGGC ATT GT GGAT GAGT GT T GCTT CC GGAGCT GT GAT CT GAGGAGACT GGAGAT GT AC 318 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I III 
Db 319 TGTGCCCCACTGAAGCCTACAAAAGCAGCCCGCTCTATCCGTGCCCAGCGCCACACTGAC 378 

Qy 241 AT GCC CAAGACT CAGAAGTAT C AGCCT C CAT CTAC CAACAAGAAAAT GAAGT CT C AGAGG 300 

I I I I I I I I I I I I I I I I I I I I I I I III II I I I I I I I I I I I I I I II III 
Db 379 AT GCC CAAGACT CAGAAGT C C C CGTCCCT AT CGACAAACAAGAAAAC GAAGCT GCAAAGG 438 

Qy 301 AGAAGGAAAGGAAGT AC AT T T GAAGAAC ACAAGT AGAGGGAGTGCAGGAAACAAGAACT A 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
Db 439 AGAAGGAAAGGAAGT ACATTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 498 

Qy 361 CAGGAT GT AGGAAGAC C CT T CT GAGGAGT GAAGAAGGACAGGCCACC GCAGGAC C CT TT G 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 4 99 CAGAAT GT AGGAGGAG C CT CC CACGGAGCAGAAAAT GC CACAT CACC GCAGGAT C CT TT G 558 

Qy 421 CT CTG C ACAGT T AC CT GTAAAC ATTGGAAT ACC GGCC A AAAAAT AAGT T T GAT C 474 

II I I I I I I I I I I I I I III I I I I I I I I I II 

Db 559 CT GCTT GAGCAAC CT GCAAAAC AT CGAAAC ACCT AC CAAATAACAAT AATAAGT C CAAT A 618 

Qy 475 AC AT T T C AAAG AT - G G CAT T T C C C C CAAT G AAA 506 

I I I I I I I I I I I I I II I I I I I I I I I I I I II I I 
Db 619 ACATTACAAAGAT GGGCATTTCC CCCAAT GAAA 651 



RESULT 11 
AAN70436 

ID AAN70436 standard; cDNA; 818 BP. 
XX 

AC AAN70436; 
XX 

DT 25-MAR-2003 (updated) 

DT 05-APR-1991 (first entry) 

XX 

DE Sequence encoding insulin-like growth factor 1A (IGF-1A) . 
XX 

KW Growth promoter; lactation enhancer; cell proliferation; ss. 
XX 

OS Homo sapiens . 
XX 

PN EP229750-A. 
XX 

PD 22-JUL-1987. 
XX 

PF 06-JAN-1987; 87EP-087 0001 . 



PR 20-NOV-1986; 86US-0929671 . 

PR 07-JAN-1986; 86US-0816662 . 
XX 

PA (UNIW ) UNIV WASHINGTON. 
XX 

PI Krivi GG, Rotwein PS; 
XX 

DR WPI; 1987-200203/29. 
XX 



PT New pre-pro-insulin-like growth factor-1 protein - obtd. by 

PT recombinant DNA procedures for use as growth promoters for 

PT enhancing lactation, for stimulating cell proliferation etc. 
XX 

PS Example; Fig 5; 59pp; English. 
XX 

CC A 42 base oligonucleotide corresponding to the DNA sequence encoding 

CC amino acids 10 to 23 of mature human IGF-I was synthesized (AAN70437) . 

CC The radiolabeled 42 mer was then employed to screen for IGF-I 

CC containing DNA sequences in a human liver cDNA library. Insulin- 

CC like growth factors- 1A and -IB cDNAs were isolated from a human cDNA 

CC library by using lambdagt 11 (AAN70435, AAN70436) . The human IGF-1 

CC genomic gene was isolated and mapped. It encodes at least two 

CC preproinsulin-like growth factor-1 proteins. An essentially pure 

CC proproinsulin-like growth factor-1 protein comprising the sequence 

CC of amino acids shown in Figure six is claimed (AAP70277) . 

CC (Updated on 25-MAR-2003 to correct PA field.) 
XX 

SQ Sequence 818 BP; 232 A; 186 C; 187 G;' 213 T; 0 other; 



Query Match 63.9%; Score 334.4; DB 8; Length 818; 

Best Local Similarity 84.6%; Pred. No. 2.4e-88; 

Matches 445; Conservative 0; Mismatches 26; Indels 55; Gaps 4 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 203 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 2 62 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 263 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 322 

Qy 121 ACAGGCAT CGTGGAT GAGT GCT GCTTCCGGAGCT GTGATCTGAGGAGGCT GGAGATGTAC 18 0 

I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 323 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 382 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

II I I I I II I I I I I I I I II III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 383 TGCGCACCCCTCT^AGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 442 

Qy 241 AT GCCCAAGACTCAGAAGTAT CAGCCT C CAT CTACCAACAAGAAAATGAAGTCTCAGAGG 300 

I I I I I I I I I I I I I I 

Db 443 AT GCCCAAGACCCAG 457 

Qy 301 AGAAG GAAAGGAAGT AC AT T T GAAGAACACAAGT AGAGGGAGT GC AGGAAACAAGAACT A 360 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I 
Db 458 AAGGAAGT AC AT T T GAAGAAC GCAAGT AGAGGGAGT GC AGGAAACAAGAACT A 510 



Qy 361 CAG GAT GT AGGAAGAC C CT TCTGAGGAGT GAAGAAGGACAGG C C AC C GCAGGAC C CT TT G 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 511 CAGGATGTAGGAAGAC C CT CCT GAGGAGT GAAGAGT GACAT GC CACC G CAGGAT CCTT T G 570 

Qy 421 CTCTGCAC-AGTTACCTG-TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACAT 47 8 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I 

Db 571 CT CT GCACGAGT T AC CT GT T AAACTTT GGAACAC C T AC CAAAAAAT AAGT TTGATAACAT 630 

Qy 47 9 T T C AAAGAT - G G CAT T T C C C C C AAT G AAAT AC AC AAGT AAAC AT T C 523 

II I I I I I I III I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 631 TTAAAAGAT GGGCGTTTCC C CCAAT GAAAT ACACAAGTAAAC AT T C 676 



RESULT 12 
ABT11091 

ID ABT11091 standard; cDNA; 7260 BP. 
XX 

AC ABT11091; 
XX 

DT 04-DEC-2002 (first entry) 
XX 

DE Human breast cancer associated coding sequence SEQ ID NO: 1225. 
XX 

KW Human; breast specific gene; breast cancer; differential expression; 

KW cytostatic; gene therapy; gene; ss. 

XX 

OS Homo sapiens. 
XX 

PN WO200259271-A2. 
XX 

PD 01-AUG-2002. 
XX 

PF 25-JAN-2002; 2002WO-US02176 . 
XX 

PR 25-JAN-2001; 2001US-2 63757P . 

PR 25-APR-2001; 2001US-2 8 6090P . 

PR 23-MAY-2001; 2 001US-2 92517P . 
XX 

PA (GENE- ) GENE LOGIC INC. 
XX 

PI Orr MS, Nation M, Diggans JC, Zeng W; 
XX 

DR WPI; 2002-674803/72. 
XX 

PT Diagnosing breast cancer in a patient comprises detecting the level of 

PT gene expression in cell or tissue samples, where a differential gene 

PT expression is indicative of breast cancer 
XX 

PS Claim 1; SEQ ID NO 1225; 260pp + Sequence Listing; English. 
XX 

CC The present invention relates to methods of diagnosing breast cancer in a 

CC patient, which comprise detecting the level of expression in a tissue 

CC sample of two or more genes selected from those shown in ABT09867- 

CC ABT11112, where a differential expression of the genes indicates breast 

CC cancer. The methods are useful in diagnosing, treating, detecting the 

CC progression, and in monitoring treatment of breast cancer in patients. 



CC The methods are also useful as a screening tool for agents that modulate 

CC the onset or progression of breast cancer. The breast cancer genes may be 

CC used as diagnostic markers for the prediction or identification of the 

CC malignant state of breast tissue, for confirming the type and progression 

CC of cancer, and for drug screening and assays. The present sequence is a 

CC coding sequence of the invention. 

CC Note: The sequence data for this patent did not form part of the printed 

CC specification, but was obtained in electronic format directly from WIPO 

CC at ftp.wipo.int/pub.published_pct_sequences. 
XX 

SQ Sequence 7260 BP; 2330 A; 1415 C; 1240 G; 2275 T; 0 other; 

Query Match 63.9%; Score 334.4; DB 24; Length 7260; 

Best Local Similarity 84.6%; Pred. No. 5.7e-88; 

Matches 445; Conservative 0; Mismatches 26; Indels 55; Gaps 4; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 311 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 37 0 

Qy 61 AGGGGCTTTT AT T T CAACAAGCC C ACAGGAT AC GGCT C C AGCAGT C GGAGGGC AC CT CAG 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 371 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 430 

Qy 121 AC AGGCATC GT GGAT GAGT GCTGCTTCCG GAG CT GTGAT CT GAGGAGG CTGGAGAT GT AC 180 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I 

Db 431 ACAGGCATCGTGGAT GAGT GCTGCTT CCGGAGCTGTGAT CTAAGGAGGCTGGAGAT GTAT 4 90 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

II I I I I I I I I I I I I I I II III I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II 

Db 491 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 550 

Qy 241 AT GC C CAAGACT C AGAAGT AT CAGC CT C CAT CT AC CAACAAGAAAAT GAAGTCT CAGAGG 300 

I I I I I I I II I I III 

Db 551 AT G C C C AAGAC C CAG 565 

Qy 301 AGAAGGAAAGGAAGTAC AT T TGAAGAAC ACAAGT AGAGGGAGT GCAGGAAACAAGAACT A 360 

I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I II I I 
Db 566 AAGGAAGT AC AT T TGAAGAAC GCAAGT AGAGGGAGT GCAGGAAACAAGAACT A 618 

Qy 361 C AGGAT GT AG GAAGAC C C T T CTGAGGAGT GAAGAAGGAC AGGCCAC CGCAGGAC C CT T T G 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 619 CAGGAT GT AG GAAGAC C CT C CTGAGGAGT GAAGAGT GAC AT GC C AC CGCAGGAT C CT TT G 678 

Qy 421 CT CT GCAC-AGTTACCT G- TAAACATTGGAATACC GGCCAAAAAATAAGTTTGAT CACAT 47 8 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I 

Db 679 CT CT GCACGAGTT AC CT GT T AAAC T T T GGAACAC CT ACCAAAAAATAAGT T TGATAAC AT 738 

Qy 479 TT CAAAGAT- GGCATTT C C CCCAAT GAAATACACAAGTAAACATT C 523 

II I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 739 T TAAAAGAT GGGC GTT T C C C CCAAT GAAATACACAAGTAAACATT C 7 84 



RESULT 13 
ABK84583 

ID ABK84583 standard; cDNA; 7260 BP. 
XX 



AC ABK84583; 
XX 

DT 14-AUG-2002 (first entry) 
XX 

DE Human cDNA differentially expressed in granulocytic cells #1154. 
XX 

KW Human; ss; granulocytic cell; DNA chip; bacterial infection; 

KW viral infection; parasitic infection; protozoal infection; 

KW fungal infection; sterile inflammatory disease; psoriasis; 

KW rheumatoid arthritis; glomerulonephritis; asthma; thrombosis; 

KW cardiac reperfusion injury; renal reperfusion injury; ARDS; 

KW adult respiratory distress syndrome; inflammatory bowel disease; 

KW Crohn's disease; ulcerative colitis; periodontal disease; 

KW granulocyte activation; chronic inflammation; allergy. 

XX 

OS Homo sapiens. 
XX 

PN WO200228999-A2 . 
XX 

PD ll-APR-2002. 
XX 

PF 03-OCT-2001; 2 001WO-US30821 . 
XX 

PR 03-OCT-2000; 2000US-237189P . 
XX 

PA (GENE- ) GENE LOGIC INC. 
XX 

PI Beazer-Barclay Y, Weissman SM, Yamaga S, Vockley J; 
XX 

DR WPI; 2002-435328/46. 
XX 

PT Detecting granulocyte activation by detecting differential expression 

PT of genes associated with granulocyte activation, which serves as 

PT diagnostic markers that is useful for monitoring disease states and 

PT drug toxicity 
XX 

PS Claim 1; SEQ ID No 1154; 114pp; English. 
XX 

CC The invention relates to detecting (Ml) granulocyte (GC) activation 

CC (GCA) , by detecting the level of expression of gene(s) (Gs) identified by 

CC DNA chip analysis as given in the specification, and comparing 

CC the expression level to an expression level in an unactivated 

CC GC, where differential expression of Gs is indicative of GCA. 

CC Also included are modulating (M2) GA by contacting GC with an agent 

CC that alters the expression of at least one gene in Gs; (2) screening (M3) 

CC for an agent capable of modulating GCA or an inflammation (especially 

CC chronic) in a tissue, an allergic response in a subject, exposure of a 

CC subject to a pathogen or sterile inflammatory disease using the 

CC gene expression profile; (3) detecting (M4) an inflammation (especially 

CC chronic) in a tissue, an allergic response in a subject, exposure of a 

CC subject to a pathogen or sterile inflammatory disease, by detecting the 

CC level of expression in a sample of the tissue of gene(s) from Gs, where 

CC the level of expression of the gene is indicative of inflammation; 

CC (4) treating (M5) an inflammation (especially chronic) or in a tissue, 

CC an allergic response in a subject, exposure of a subject to a pathogen 

CC or sterile inflammatory disease, by contacting a tissue having 

CC inflammation with an agent that modulates the expression of gene(s) 



CC from Gs in the tissue. Ml is useful for detecting GCA; M2 is useful for 

CC modulating GA; M3 is useful for screening an agent capable of modulating 

CC GCA preferably in an inflammation in a tissue; M4 is useful for 

CC detecting an inflammation (especially chronic) in a tissue, an allergic 

CC response in a subject, exposure of a subject to a pathogen or sterile 

CC inflammatory disease (e.g. psoriasis, rheumatoid arthritis, 

CC glomerulonephritis, asthma, thrombosis, cardiac reperfusion injury, renal 

CC reperfusion injury, ARDS, adult respiratory distress syndrome, 

CC inflammatory bowel disease, Crohn ! s disease, ulcerative colitis, 

CC periodontal disease; also bacterial infection, viral infection, 

CC parasitic infection, protozoal infection, fungal infection and M5 is 

CC useful for treating one of the above conditions. The present 

CC sequence represents a gene differentially expressed in granulocytes. 

CC Note: The sequence data for this patent did not form part 

CC of the printed specification, but was obtained in electronic 

CC format directly from WIPO at 

CC ftp . wipo . int/ pub/ published_pct_sequences . 

XX 

SQ Sequence 7260 BP; 2330 A; 1415 C; 1240 G; 2275 T; 0 other; 



Query Match 63.9%; Score 334.4; DB 24; Length 72 60; 

Best Local Similarity 84.6%; Pred. No. 5.7e-88; 

Matches 445; Conservative 0; Mismatches 26; Indels 55; Gaps 4; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 311 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 37 0 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 371 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 430 

Qy 121 ACAGGCAT CGT GGAT GAGT GCT GCT T C C GGAGCT GT GAT CT GAGGAGGCT GGAGAT GT AC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 

Db 431 ACAGGCAT CGT GGATGAGT GCT GCT T C C GGAGCT GT GAT CTAAGGAGGCT GGAGAT GT AT 490 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

II I I I I I I I I I I I I I I II III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 491 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 550 

Qy 241 AT GCCCAAGACTCAGAAGTATCAGCCTCCAT CTACCAACAAGAAAAT GAAGTCT CAGAGG 300 

I I I I I I I I I I I III 

Db 551 ATGCCCAAGACCCAG 565 

Qy 301 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 566 AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAACAAGAACTA 618 

Qy 361 CAGGATGTAGGAAGACCCTTCTGAGGAGT GAAGAAGGACAGGCCAC CGCAGGACCCTTT G 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 619 CAGGATGTAGGAAGACCCTCCTGAGGAGT GAAGAGT GACAT GC CAC CGCAGGAT CCTTT G 678 

Qy 421 CT CT GC AC- AGTTACCT G- TAAACATT GGAAT AC C GGC CAAAAAATAAGT T T GAT CAC AT 478 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I II I 
Db 679 CT CT GCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTT GATAACAT 738 



Qy 



479 TT CAAAGAT - GGCAT T T C C CC C AAT GAAAT ACAC AAGT AAAC AT T C 523 



II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 739 TT AAAAGAT GGGCGT TT CCC C CAAT GAAAT ACACAAGTAAACAT T C 784 



RESULT 14 
ABN97244 

ID ABN97244 standard; DNA; 7260 BP. 
XX 

AC ABN97244; 
XX 

DT 13-AUG-2002 (first entry) 
XX 

DE Gene #3742 used to diagnose liver cancer. 
XX 

KW Gene; liver cancer; ds ; hepatocellular carcinoma; hepatotropic; 

KW metastatic liver tumour; cytostatic; expression profile; disease state; 

KW disease progression; drug toxicity; drug efficacy; drug metabolism. 

XX 

OS Homo sapiens. 
XX 

PN WO200229103-A2 . 
XX 

PD ll-APR-2002. 
XX 

PF 02-OCT-2001; 2001WO-US30589 . 
XX 

PR 02-OCT-2000; 2000US-237054P . 
XX 

PA (GENE-) GENE LOGIC INC. 
XX 

PI Home D, Alvares C, Peres-Da-Silva S, Vockley JG; 
XX 

DR WPI; 2002-426119/45. 
XX 

PT Diagnosing and detecting the progression of liver cancer, 

PT hepatocellular carcinoma or metastatic liver tumor in a patient, 

PT involves detecting the level of expression of two or more genes in a 

PT liver tissue sample 

XX 

PS Claim 1; SEQ ID NO 3742; 298pp; English. 
XX 

CC The invention relates to a novel method for diagnosing and detecting the 

CC progression of liver cancer, hepatocellular carcinoma or metastatic liver 

CC tumour in a patient, and differentiating metastatic liver cancer from 

CC hepatocellular carcinoma in a patient, involving detecting the level of 

CC expression of two or more genes represented in ABN93503-ABN974 55 in a 

CC tissue sample. The method of the invention has hepatotropic, and 

CC cytostatic activity. The method is useful for diagnosing and detecting 

CC the progression of liver cancer, hepatocellular carcinoma and metastatic 

CC liver carcinoma in a patient. The method is useful for identifying 

CC expression profiles which serve as useful diagnostic markers as well as 

CC markers that can be used to monitor disease states, disease progression, 

CC drug toxicity, drug efficacy and drug metabolism. 

CC Note: The sequence data for this patent did not form part of the printed 

CC specification, but was obtained in electronic format directly from WIPO 

CC at ftp.wipo.int/pub/published_pct_sequences. 
XX 



SQ Sequence 7260 BP; 2330 A; 1415 C; 1240 G; 2275 T; 



0 others- 



Query Match 63.9%; Score 334.4; DB 24; Length 7260; 

Best Local Similarity 84.6%; Pred. No. 5.7e-88; 

Matches 445; Conservative 0; Mismatches 26; Indels 55; Gaps 4; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 311 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 370 

Qy 61 AGGGGCT T T T AT TT CAACAAGC C CACAGGAT AC GGCT C CAGC AGT C GGAGGG CAC CT CAG 12 0 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 371 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 430 

Qy 121 ACAGGC AT C GT GGAT GAGT GCT GCT T C C GGAGCT GT GAT CT GAGGAGGCTGGAGAT GTAC 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 431 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 4 90 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

II I I I I I I I I I I I I I I II III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 4 91 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 550 

Qy 241 AT GC C CAAGACT CAGAAGT ATCAGC CT C C AT CT AC CAACAAGAAAAT GAAGT CT CAGAGG 300 

I I I I I I I I I I I III 

Db 551 AT G C C C AAG AC C CAG 565 

Qy 301 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 3 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II 
Db 566 AAGGAAGT ACAT TTGAAGAAC GCAAGT AGAGG GAGT GCAGGAAACAAGAACT A 618 

Qy 361 C AGGAT GT AGGAAGAC C CT T CT GAGGAGT GAAGAAGGAC AGGC CAC C GC AGGACCCTTT G 420 

I I I I I I I I I I I I I I I I I II I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 619 CAGGAT GTAGGAAGACCCTCCTGAGGAGT GAAGAGTGACATGC CAC CGC AGGAT CCTTT G 678 

Qy 421 CT CT GCAC - AGTT AC CT G - TAAACATT GGAAT AC C GGC CAAAAAATAAGT TT GATCACAT 478 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I 

Db 679 CTCTGCACGAGTTACCTGTTA7^CTTTGGAACACCTACCAAAAAATAAGTTTGATAACAT 738 

Qy 479 TTCAAAGAT - GGCATTT CCCCCAATGAAATACACAAGTAAACATT C 523 

II I I I I I I III I I I I I I I I I I I I I I I I I I I I II I I I I I M I I I 

Db 739 TT AAAAGAT GG GC GTTT C C CC CAAT GAAAT ACACAAGT AAACAT T C 7 84 



RESULT 15 
ABK64812 

ID ABK64812 standard; DNA; 7260 BP. 
XX 

AC ABK64812; 
XX 

DT 18-JUN-2002 (first entry) 
XX 

DE Human benign prostatic hyperplasia gene #707. 
XX 

KW Human; benign prostatic hyperplasia; BPH; prostate cancer; gene; ds . 
XX 

OS Homo sapiens . 
XX 



PN WO200212440-A2. 
XX 

PD 14-FEB-2002. 
XX 

PF 07-AUG-2001; 2001WO-US24708 . 
XX 

PR 07-AUG-2000; 2000US-223323P . 

PR 05-JUN-2001; 2001US-0873319 . 
XX 

PA (GENE-) GENE LOGIC INC. 

PA (NISB ) JAPAN TOBACCO INC. 
XX 

PI Munger WE, Kulkarni P, Getzenberg RH, Waga I, Yamamoto J; 
XX 

DR WPI; 2002-257476/30. 
XX 

PT Identifying drugs for and diagnosing benign prostatic hyperplasia, by 

PT detecting expression levels of one or more genes in prostate cells from 

PT patient that are differentially regulated compared to normal prostate 

PT cells 
XX 

PS Disclosure; Page 391-393; 444pp; English. 
XX 

CC The invention relates to a method of diagnosing (I) the onset or 

CC progression of benign prostatic hyperplasia (BPH) , or screening (II) for 

CC or identifying an agent that modulates the onset or progression of BPH. 

CC The method is based on changes in gene expression in BPH tissue isolated 

CC from patients exhibiting different clinical states of prostate 

CC hyperplasia as compared to normal prostate tissue. (I) comprises 

CC detecting the expression levels of one or more genes in prostate cells 

CC from the subject that are differentially regulated compared to normal 

CC prostate cells. (II) comprises preparing a first gene expression profile 

CC of BPH cells or BPH-like cell population, exposing the cells to the 

CC agent, preparing a second gene expression profile of the agent exposed 

CC cells, and comparing the first and second gene expression profiles. 

CC (I) is useful for diagnosing the onset or progression of BPH. (II) is 

CC useful for identifying an agent that modulates the onset or progression 

CC of BPH. The methods are useful to present information identifying 

CC the expression level in a tissue or cells, by comparing the expression 

CC level of genes given in the specification in the tissue or cells to the 

CC level of expression of gene in the database, and displaying the 

CC expression levels of at least one gene in the tissue or cell sample 

CC compared to the expression level in BPH. Agents using (II) are useful for 

CC treating BPH or prostate cancer. ABK64 106-ABK64860 represent human 

CC benign prostatic hyperplasia gene sequences of the invention. 

XX 

SQ Sequence 7260 BP; 2330 A; 1415 C; 1240 G; 2275 T; 0 other; 

Query Match 63.9%; Score 334.4; DB 24; Length 7260; 
Best Local Similarity 84.6%; Pred. No. 5.7e-88; 

Matches 4 45; Conservative 0; Mismatches 26; Indels 55; Gaps 4; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 311 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 370 



Qy 



61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 

Db 371 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 430 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 

Db 431 ACAGG CAT C GT GGAT GAGT GC T GCT T C C GGAGCT GT GAT CTAAGGAGGCT GGAGAT GT AT 4 90 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

II I I I II I II I I I I I I II III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 491 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 550 

Qy 241 AT GC C CAAGACT C AGAAGT AT CAGC CT CCATCT AC CAACAAGAAAAT GAAGT CT CAGAGG 300 

I I I I I I I I I I I II I 

Db 551 AT G C C C AAGAC C C AG 565 

Qy 301 AGAAGGAAAGGAAGTAC AT T T GAAGAACACAAGT AGAGGGAGT GCAGGAAACAAGAACTA 360 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 566 AAGGAAGT ACATT T GAAGAACGCAAGT AGAG GGAGT GCAGGAAACAAGAACTA 618 

Qy 361 CAGGAT GTAGGAAGACCCTTCT GAGGAGT GAAGAAGGACAGGCCACCGCAGGACCCTTT G 420 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I II 
Db 619 CAGGAT GTAGGAAGACCCT CCT GAGGAGT GAAGAGT GACATGCCACCGCAGGATCCTTTG 678 

Qy 421 CT CT GCAC - AGT T ACCT G- T AAACAT T GGAATAC C GG C CAAAAAATAAGTT T GAT CAC AT 47 8 

I I I I I I I I I I I I I I I I I M II I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I 

Db 679 CT CT GCACGAGTTACCT GTTAAACTTT GGAACACCTAC CAAAAAATAAGTTTGATAACAT 738 

Qy 479 T T CAAAGAT - GGC AT TT C C CC CAAT GAAATACACAAGT AAACAT T C 523 

II I I I I I I III I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

Db 739 T TAAAAGAT GGGCGTTTCCCC CAAT GAAATACACAAGT AAACATT C 784 
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US-09-142-583A-3 

; Sequence 3, Application US/09142583A 
; Patent No. 6221842 
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; STATE: VA 
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; FILING DATE: ll-MAR-1997 

APPLICATION NUMBER: GB 9605124.8 
FILING DATE: ll-MAR-1996 
ATTORNEY/AGENT INFORMATION: 
NAME: SADOFF, B. J. 
REGISTRATION NUMBER : 36663 
REFERENCE/DOCKET NUMBER: 117-263 
; TELECOMMUNICATION INFORMATION: 

; TELEPHONE: 7038164000 

; TELEFAX: 7038164100 

INFORMATION FOR SEQ ID NO: 3: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 553 base pairs 
TYPE: nucleic acid 
STRANDEDNESS: both 
TOPOLOGY: linear 
; MOLECULE TYPE: cDNA 

FEATURE : 

NAME/ KEY: CDS 
LOCATION: 1..363 
SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
US-09-142-583A-3 

Query Match 100.0%; Score 523; DB 3; Length 553; 

Best Local Similarity 100.0%; Pred. No. 1.6e-154; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 GGACCGGAGACGCT CTGCGGT GCTGAGCT GGT GGATGCT CTT CAGTTCGT GTGT GGAGAC 60 




Db 



31 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 90 



QY 



61 AGGGGCT T TT ATTT CAACAAGCCCACAGGAT AC GGCT C CAGCAGT CGGAGGGCAC CT CAG 120 




Db 



91 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 150 



121 AC AGGCAT C GT GGAT GAGT GCT GCTT C C GGAGCT GTGAT CT GAGGAGGCT GGAGAT GTAC 180 




Db 



151 ACAGGC AT C GT GGAT GAGT GCT GCTT C C GGAGCT GTGAT CT GAGGAGGCT GGAGAT GTAC 210 



Qy 



181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 




Db 



211 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 270 



Qy 



241 AT GC C CAAGACT CAGAAGT AT CAGCCT CCAT CT ACCAACAAGAAAAT GAAGT CT CAGAGG 300 




Db 



271 AT G C C CAAGACT CAGAAGT AT CAGCCT CCAT CT ACCAACAAGAAAAT GAAGT CT CAGAGG 330 



Qy 



301 



AGAAGGAAAGGAAGT ACAT T T GAAGAACACAAGT AGAGGGAGT GCAGGAAACAAGAACT A 
I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 



360 



Db 



331 AGAAGGAAAGGAAGT AC AT TT GAAGAAC ACAAGT AGAGGGAGT GCAGGAAACAAGAACT A 390 



Qy 



361 CAG GAT GT AGGAAGAC C CT T CT GAG GAGT GAAGAAGGACAGGC CAC C GC AG GAC C CT T T G 420 




Db 



391 CAGGAT GT AGGAAGAC C CT T CT GAG GAGT GAAGAAGGACAGGC CAC C GCAG GAC C CTTT G 450 



Qy 



421 CT CT GCACAGT T AC CT GTAAACAT T GGAAT AC C GGC CAAAAAATAAGT T T GAT CACAT T T 4 80 




Db 



451 CT CT GCACAGT T AC CT GT AAACATT GGAAT AC C GGC CAAAAAATAAGT T T GAT CACAT T T 510 



Qy 



481 CAAAGAT GGCATT T C C CC CAAT GAAAT AC ACAAGT AAACATT C 523 




Db 



511 CAAAGAT GGCATTT CCCCCAATGAAATACACAAGTAAACATTC 553 



RESULT 2 

US-09-142-583A-5 

; Sequence 5, Application US/09142583A 
; Patent No. 6221842 

GENERAL INFORMATION: 

APPLICANT: GOLDSPINK, GEOFFREY 

TITLE OF INVENTION: METHOD OF TREATING MUSCULAR DISORDERS 
NUMBER OF SEQUENCES: 11 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: NIXON & VANDERHYE P.C. 

STREET: 1100 NORTH GLEBE ROAD 

CITY: ARLINGTON 

STATE: VA 
; COUNTRY: USA 

ZIP : 22201 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/142 , 583A 

FILING DATE: 29-Oct-1998 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: WO PCT/GB97/00658 

FILING DATE: ll-MAR-1997 

APPLICATION NUMBER: GB 9605124.8 

FILING DATE: ll-MAR-1996 
ATTORNEY/ AGENT INFORMATION: 

NAME: SADOFF, B. J. 

REGISTRATION NUMBER: 36663 

REFERENCE/ DOCKET NUMBER: 117-263 
; TELECOMMUNICATION INFORMATION: 

TELEPHONE: 7038164000 

TELEFAX: 7038164100 
; INFORMATION FOR SEQ ID NO: 5: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 553 base pairs 

; TYPE: nucleic acid 

STRANDEDNESS : both 
; TOPOLOGY: linear 



MOLECULE TYPE: cDNA 
FEATURE : 

NAME/ KEY: CDS 
LOCATION: 341.. 397 
SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
US-09-142-583A-5 

Query Match 100.0%; Score 523; DB 3; Length 553; 

Best Local Similarity 100.0%; Pred. No. 1.6e-154; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 31 GGAC C GGAGAC GCT C T GC GGT GCT GAGCT GGT GGAT GCT CT T CAGT TC GT GT GT GGAGAC 90 

Qy 61 AGGGGCT T T TAT TT CAACAAGCCCAC AGGAT AC GGCT C CAGC AGT CGGAGGGC AC CT CAG 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 91 AGGGG CT T TT AT TT CAACAAGC CCACAGGAT AC GGCT C CAGC AGT C GGAGGGCAC CT CAG 150 

Qy 121 AC AG GCAT C GT GGAT GAGT GCT GCTT C C GGAGCT GTGAT CT GAGGAGGCT GGAGAT GTAC 18 0 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 151 ACAGGCAT C GT GGAT GAGT GCT GCTT C C GGAG CT GTGAT CT GAGGAGGCT GGAGAT GTAC 210 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 211 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 270 

Qy 241 AT GC C CAAGACT CAGAAGT AT CAGCCT C C AT CT ACCAACAAGAAAAT GAAGT CT CAGAGG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 271 AT GC C CAAGACT CAGAAGT AT CAGCCT C CAT CT ACCAACAAGAAAAT GAAGT CT CAGAGG 33 0 

Qy 301 AGAAGGAAAGGAAGT ACAT T T GAAGAACACAAGT AGAGGGAGT GCAGGAAACAAGAACT A 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 331 AGAAGGAAAGGAAGT ACATTT GAAGAACACAAGT AGAGGGAGT GCAGGAAACAAGAACT A 390 

Qy 361 CAGGAT GTAGGAAGACCCTT CTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I M 
Db 391 CAGGAT GTAGGAAGACCCTT CTGAGGAGT GAAGAAGGACAGGCCACCGCAGGACCCTTTG 450 

Qy 421 CT CT GC ACAGT T ACCT GT AAACATT GGAAT AC C GGCCAAAAAATAAGTTT GAT C ACAT TT 480 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 451 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCT^AAAAATAAGTTTGATCACATTT 510 

Qy 481 CAAAGAT GGCAT TT C C C C CAAT GAAAT ACACAAGT AAAC AT T C 523 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 511 CAAAGAT GGCAT TT C C C C CAAT GAAAT ACACAAGTAAACAT T C 553 



RESULT 3 

US-09-142-583A-10 

; Sequence 10, Application US/09142583A 

; Patent No. 6221842 

; GENERAL INFORMATION: 

APPLICANT: GOLDSPINK, GEOFFREY 
; TITLE OF INVENTION: METHOD OF TREATING MUSCULAR DISORDERS 

NUMBER OF SEQUENCES: 11 

CORRESPONDENCE ADDRESS: 



; ADDRESSEE : NIXON & VANDERHYE P.C. 

STREET: 1100 NORTH GLEBE ROAD 

CITY: ARLINGTON 

STATE: VA 

COUNTRY: USA 

ZIP : 22201 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS /MS-DOS 

SOFTWARE : Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/09/142, 583A 

FILING DATE: 29-Oct-1998 
; CLASSIFICATION: <Unknown> 

; PRIOR APPLICATION DATA: 

; APPLICATION NUMBER: WO PCT/GB97/ 00658 

; FILING DATE: ll-MAR-1997 

; APPLICATION NUMBER: GB 9605124.8 

FILING DATE: ll-MAR-1996 
ATTORNEY/AGENT INFORMATION: 
NAME: SADOFF, B. J. 
REGISTRATION NUMBER: 36663 
REFERENCE/ DOCKET NUMBER: 117-263 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 7038164000 
TELEFAX: 7038164100 
INFORMATION FOR SEQ ID NO: 10: 
SEQUENCE CHARACTERISTICS: 
; LENGTH: 777 base pairs 

; TYPE: nucleic acid 

STRANDEDNESS: both 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
; FEATURE : 

NAME/ KEY: CDS 
LOCATION: 2 6.. 4 93 
SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
US-09-142-583A-10 



Query Match 63.6%; Score 332.8; DB 3; Length 777; 

Best Local Similarity 84.4%; Pred. No. 9.8e-95; 

Matches 444; Conservative 0; Mismatches 27; Indels 55; Gaps 4; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 17 9 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 238 



Qy 61 AGG GGCT T T T AT TT CAACAAGC CCACAGGAT AC GGCT C CAGCAGT C GGAGG GCAC CT CAG 120 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 239 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 2 98 

Qy 121 ACAGGCAT CGT GGATGAGT GCT GCTT CCGGAGCTGTGAT CTGAGGAGGCT GGAGAT GTAC 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 99 ACAGGT AT C GT GGAT GAGT GCT GCTT C CGGAGCT GT GAT CTAAGGAGG CT GGAGAT GTAT 358 



Qy 



181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 



II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II I 1 1 1 1 I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 359 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 418 

Qy 241 ATGCCCAAGACTCAGAAGTAT CAGCCT CCATCTACCAACAAGAAAAT GAAGT CTCAGAGG 300 

I I I I I I I I I I I III 

Db 419 AT GC C C AAGAC C C AG 4 33 

Qy 301 AGAAGGAAAGGAAGTACAT T T GAAGAAC ACAAGT AGAGGGAGT GCAGGAAACAAGAACT A 360 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 434 AAGGAAGTACAT T T GAAGAAC GCAAGT AGAGGGAGT GCAGGAAACAAGAACT A 486 

Qy 361 CAGGATGTAGGAAGACC CTT CT GAGGAGT GAAGAAGGACAGGCCACC GCAGGACCCTTTG 420 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 487 CAGGATGT AGGAAGAC C CT C CT GAGGAGT GAAGAGT GAC AT GC C AC C GCAGGAT CCTT T G 54 6 

Qy 421 CTCT GCAC- AGTT ACCT G- TAAAC AT T GGAATAC C GGC CAAAAAATAAGT T T GAT C AC AT 478 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I II I I I I I I I I II I I 

Db 547 CTCT GCACGAGTT ACCT GT TAAACT T T GGAACAC CT AC CAAAAAATAAGTT T GATAACAT 606 

Qy 479 T T C AAAG AT - G G CAT T T C C C C C AAT GAAAT AC AC AAGT AAAC AT T C 523 

II I I I I I I III I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I 

Db 607 TTAAAAGATGGGCGTTTCCCCCAAT GAAATACACAAGTAAACATT C 652 



RESULT 4 
5405942-2 

; Patent No. 5405942 

APPLICANT : BELL, GRAEME I . ; RALL, LESLIE B. ;MERRYWEATHER, 
; JAMES P * 

TITLE OF INVENTION: PREPRO INSULIN-LIKE GROWTH FACTORS 
;I AND II 

NUMBER OF SEQUENCES : 16 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/07/65,673 
FILING DATE: 16-JUN-1987 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 630,557 
FILING DATE: 19-JUL-1984 
;SEQ ID NO: 2: 

LENGTH: 622 
5405942-2 

Query Match 63.3%; Score 331.2; DB 6; Length 622; 

Best Local Similarity 68.3%; Pred. No. 2.8e-94; 

Matches 359; Conservative 84; Mismatches 28; Indels 55; Gaps 4; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I : I : I I II I I : I I I I : I I : I I I : I I : I : : I I I : : I I : I : I : I I I I I I 
Db 45 GGACCGGAGACGCUCUGCGGGGCUGAGCUGGUGGAUGCUCUUCAGUUCGUGUGUGGAGAC 104 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

I I I I I I :::: I ::: I I I I I I I I I I I I I I I : I I I I : I I I I I I I : I I I I I II I I I : I I I 

Db 105 AGGGGCUUUUAUUUCAACAAGCCCACAGGGUAUGGCUCCAGCAGUCGGAGGGCGCCUCAG 164 

Qy 121 ACAGGCAT CGTGGAT GAGT GCTGCTTCCG GAGCT GT GAT CT GAGGAG G CT GGAGAT GT AC 18 0 

I I I I I I : I I : I I I : I I I : I I : I I I I II I I I : I : I I : I I I I I I I I : I I I I I : I : I 
Db 165 ACAGGUAUCGUGGAU GAGU GCU GCUU CC G GAGCUGUGAU C AUAGGAG GCU GGAGAU GU AU 22 4 



Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

: I I I I I I I I : I I I I I I II III I I I I 111:1 I : I I I : I I II I I I I I II II I I I I I 
Db 225 UGCGCACCCCUCAAGCCUGCCAAGUCAGCUCGCUCUGUCCGUGCCCAGCGCCACACCGAC 284 

Qy 241 AT GC C CAAGAC T CAGAAGTAT C AGC CT C CAT CT AC CAACAAGAAAAT GAAGT CT C AGAGG 300 

I : I I I I I I I I I III 

Db 285 AU G C C CAAGAC C C AG 299 

Qy 301 AGAAGGAAAGGAAGTACATTT GAAGAACACAAGTAGAGGGAGT GCAGGAAACAAGAACTA 360 

I I I I I I I : I I I ::: I I I I I I I I I I I : I I I I I I I I : I I I I I I I I I I I I I I I : I 
Db 300 AAGGAAGUACAUUUGAAGAACGCAAGUAGAGGGAGUGCAGGAAACAAGAACUA 352 

Qy 361 CAG GAT GT AGGAAGAC C CTT CT GAGGAGT GAAGAAGGACAGG C CAC C GC AGGAC CCTTT G 420 

I I I I I : I : I I I I I I I I I I : I : I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I : :: I 
Db 353 C AGGAU GU AGGAAGAC C CUC CU GAGGAGU GAAGAGU GACAUGC CAC C GC AGGAU C CUUUG 412 

Qy 421 CT CT GCAC - AGT T AC CT G- TAAAC AT T GGAATAC C GGC CAAAAAATAAGTTT GAT C ACAT 478 

I : I : I I I I I I : : I I I : I : I I I I : : I I I I I I I I I I I I | | | : | | | : : : | | : I I I : 
Db 413 CUCUGCACGAGUUACCUGUUAAACUUUGGAACACCUACCAAAAAAUAAGUUUGAUAACAU 472 

Qy 479 TTCAAAGAT - GGCATTT CCCCCAAT GAAATACACAAGTAAACATT C 523 

: : Mill: III : : : I I I I I I I : I I I I : I I I I I I I : I I I I I : : I 
Db 473 UUAAAAGAU GGGC GUUU C CCCCAAU GAAAUACACAAGUAAACAUUC 518 



RESULT 5 

US-08-472-809B-8 

; Sequence 8, Application US/08472809B 

; Patent No. 5925564 

; GENERAL INFORMATION: 

APPLICANT: Schwartz, Robert J. 
APPLICANT: DeMayo, Franco J. 
APPLICANT: O'Malley, Bert W. 
; TITLE OF INVENTION: Expression Vector Systems and 
TITLE OF INVENTION: Method of Use 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS : 
ADDRESSEE: Lyon & Lyon 
; STREET: 633 West Fifth Street 

; STREET: Suite 4700 

CITY: Los Angeles 
STATE: California 
COUNTRY: U.S.A. 
ZIP: 90071-2066 
COMPUTER READABLE FORM: 

MEDIUM TYPE: 3.5" Diskette, 1.44 Mb 
; MEDIUM TYPE: storage 

COMPUTER: IBM Compatible 
OPERATING SYSTEM: IBM P.C. DOS 5.0 
SOFTWARE: Word Perfect 5.1 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/472, 809B 
; FILING DATE: June 7, 1995 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/209,846 



FILING DATE: March 9, 1994 

APPLICATION NUMBER: 07/789,919 

FILING DATE: No. 592 55 64 ember 6, 1991 
ATTORNEY/ AGENT INFORMATION: 
; NAME: Warburg, Richard J. 

REGISTRATION NUMBER: 32,327 

REFERENCE/ DOCKET NUMBER: 214/212 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (213) 489-1600 
; TELEFAX: (213) 955-0440 

; TELEX: 67-3510 

; INFORMATION FOR SEQ ID NO: 8: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 5707 bases 

TYPE: nucleic acid 

STRANDEDNESS: double 

TOPOLOGY: linear 
; MOLECULE TYPE: cDNA 
US-08-472-809B-8 

Query Match 52.5%; Score 274.6; DB 2; Length 5707; 

Best Local Similarity 82.2%; Pred. No. 5.3e-76; 

Matches 351; Conservative 0; Mismatches 24; Indels 52; Gaps 1; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 793 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 852 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 853 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 912 

Qy 121 AC AGGC AT C GT GGAT GAGT GCT GCTT C C GGAGCT GT GAT CT GAGGAGGCT GGAGAT GT AC 18 0 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 913 ACAGGCAT CGT GGAT GAGT GCT GCTT C C GGAGCT GT GAT CTAAGGAGG CT GGAGAT GT AT 972 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

II I I I I I I I I I I I I I I II III I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 

Db 973 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 1032 

Qy 241 AT GC C CAAGACT CAGAAGT AT CAGCCT C CAT CT AC CAACAAGAAAAT GAAGT CT CAGAG G 300 

I I I I I I I I I I I III 

Db 1033 AT G C C C AAG AC C C AG 1047 

Qy 301 AGAAGGAAAGGAAGT AC ATT T GAAGAACACAAGT AGAGGGAGT GC AGGAAACAAGAACT A 360 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 104 8 AAGGAAGT AC ATT T GAAGAAC GCAAGT AGAGGGAGT GC AGGAAACAAGAACT A 1100 

Qy 361 CAGGAT GT AGGAAGAC C CT T CT GAGGAGT GAAGAAGGACAG GC C AC C GC AGGACCCT TT G 420 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 1101 CAGGAT GTAGGAAGAC C CT C CT GAGGAGT GAAGAGT GACAT GC C AC C G C AG GAT CC C C C G 1160 

Qy 421 CTCTGCA 427 

I I I I I 

Db 1161 GGCTGCA 1167 



RESULT 6 

US-08-472-809B-7 

; Sequence 7, Application US/08472809B 

; Patent No. 5925564 

; GENERAL INFORMATION: 

APPLICANT: Schwartz, Robert J. 
APPLICANT: DeMayo, Franco J. 
; APPLICANT: O'Malley, Bert W. 

TITLE OF INVENTION: Expression Vector Systems and 
TITLE OF INVENTION: Method of Use 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Lyon & Lyon 

STREET: 633 West Fifth Street 
STREET: Suite 4700 
; CITY: Los Angeles 

; STATE: California 

COUNTRY: U.S.A. 
ZIP: 90071-2066 
COMPUTER READABLE FORM: 

MEDIUM TYPE: 3.5" Diskette, 1.44 Mb 
; MEDIUM TYPE: storage 

COMPUTER: IBM Compatible 
OPERATING SYSTEM: IBM P.C. DOS 5.0 
SOFTWARE: Word Perfect 5.1 
; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/472, 809B 
FILING DATE: June 7, 1995 
CLASSIFICATION: 435 
; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/209,846 
FILING DATE: March 9, 1994 
APPLICATION NUMBER: 07/789,919 
FILING DATE: No. 5925564ember 6, 1991 
; ATTORNEY/AGENT INFORMATION: 
NAME: Warburg, Richard J. 
REGISTRATION NUMBER: 32,327 
REFERENCE/ DOCKET NUMBER: 214/212 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: (213) 489-1600 

TELEFAX: (213) 955-0440 
TELEX : 67-3510 
; INFORMATION FOR SEQ ID NO: 7: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 6345 bases 
; TYPE: nucleic acid 

STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
US-08-472-809B-7 

Query Match 52.5%; Score 274.6; DB 2; Length 6345; 

Best Local Similarity 82.2%; Pred. No. 5.6e-76; 

Matches 351; Conservative 0; Mismatches 24; Indels 52; Gaps 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 

I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I | | | 



Db 



37 02 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 3761 



Qy 61 AGGGGCTTT T ATTT CAACAAGC C CAC AGGAT AC GGCT C C AGC AGT C GGAGGGCAC C T CAG 120 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I MINI 
Db 37 62 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 3821 

Qy 121 AC AG GCAT C GT GGAT GAGT GCT GCTT C C GGAGCT GT GAT CT GAGGAGGCTGGAGAT GTAC 18 0 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3822 ACAGGCAT C GT GGAT GAGT GCT GCT T C C GGAGCT GTGAT CTAAGGAG G CT GGAGAT GT AT 3881 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

II I I I I I I I I I I I I I I II III I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 

Db 38 82 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 3941 

Qy 241 AT GC C CAAGAC T CAGAAGT AT CAGC CT C CAT CT ACCAACAAGAAAAT GAAGT CT CAGAGG 300 

II I I I I I I I I I III 

Db 3942 AT G C C CAAGAC C CAG 3956 

Qy 301 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3957 AAGGAAGT AC AT TT GAAGAAC GCAAGT AGAGGGAGT GCAGGAAACAAGAACT A 4009 

Qy 361 C AGGAT GT AGGAAGAC C CTT CT GAGGAGT GAAGAAGGACAGGC CAC C GCAGGAC CCTT T G 420 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 4010 CAGGAT GTAGGAAGAC C CT C CTGAGGAGT GAAGAGT GACAT GC CAC C GC AGGAT C C C C C G 4 069 

Qy 421 CTCTGCA 427 

II I I I 

Db 4070 GGCTGCA 4076 



RESULT 7 

5405942-13 

;Patent No. 5405942 

; APPLICANT: BELL, GRAEME I.;RALL, LESLIE B. ;MERRYWEATHER, 
; JAMES P. 

; TITLE OF INVENTION: PREPRO INSULIN-LIKE GROWTH FACTORS 

;I AND II 

NUMBER OF SEQUENCES: 16 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/07/65,673 
FILING DATE: 16-JUN-1987 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 630,557 
FILING DATE: 19-JUL-1984 
;SEQ ID NO:13: 
; LENGTH: 357 

5405942-13 

Query Match 44.8%; Score 234.4; DB 6; Length 357; 

Best Local Similarity 93.8%; Pred. No. 5.1e-64; 

Matches 244; Conservative 0; Mismatches 16; Indels 0; Gaps 0; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 43 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 102 



Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 103 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 162 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I II I I I II I I I I II I I I I 

Db 163 ACAGGTATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 222 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

II I I I I I I I I I I III II III I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 

Db 223 TGCGCACCCCTCAGGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 282 

Qy 241 AT GC C C AAGAC T C AGAAGT A 260 

I I I I I I I I I I I I I I I I I I 

Db 283 AT G C C C AAGAC C C AG AAG GA 302 



RESULT 8 
5405942-9 
Patent No. 5405942 

APPLICANT: BELL, GRAEME I . ; RALL, LESLIE B. ;MERRYWEATHER, 
JAMES P. 

TITLE OF INVENTION: PREPRO INSULIN-LIKE GROWTH FACTORS 
I AND II 

NUMBER OF SEQUENCES: 16 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/07/65,673 
FILING DATE: 16-JUN-1987 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 630,557 
FILING DATE: 19-JUL-1984 
SEQ ID NO: 9: 

LENGTH: 357 
5405942-9 

Query Match 44.5%; Score 232.8; DB 6; Length 357; 

Best Local Similarity 76.5%; Pred. No. 1.6e-63; 

Matches 199; Conservative 44; Mismatches 17; Indels 0; Gaps 0; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I II I I I I : I : I I I I I I : I I I I : I I : I I I I I : I : : I I I : : I I : I : I : I I I I I I 
Db 4 3 GGACCGGAGACGCUCUGCGGGGCUGAGCUGGUGGACGCUCUUCAGUUCGUGUGUGGAGAC 102 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 

I I I I I I :::: I ::: I I I I I I I I I I I I I I I : I I I I : I I I I I I I : I I I I I I I I I I : I I I 
Db 103 AGGGGCUUUUAUUUCAACAAGCCCACAGGGUAUGGCUCCAGCAGUCGGAGGGCGCCUCAG 162 

Qy 121 ACAG GCAT C GT GGATGAGTG CT GCTT C C GGAG CT GT GAT CT GAGGAGGCT GGAGAT GT AC 180 

I I I I I I : I I : I I I : I I I : I I : I : : I I I I I I I : I : II : I : I I I I I I I : I I I I I : I : I 
Db 163 ACAGGUAUC GUGGAUGAGUGCUGUUU C C GGAGCUGU GAU CUAAGGAG GCU G GAGAU GUAU 222 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

: I 1111111:111111 II III I I I I I I I : I I : I I I : I I I I I I I I I I I M I I I I I 
Db 223 UGCGCACCCCUCAAGCCUGCCAAGUCAGCUCGCUCUGUCCGUGCCCAGCGCCACACCGAC 282 

Qy 241 AT GC CCAAGACT C AGAAGT A 260 

I : I I I I I I I I I I I I II I I 



Db 



2 83 AUGCCCAAGACCCAGAAGGA 302 



RESULT 9 
5405942-7 

/Patent No. 5405942 

APPLICANT: BELL, GRAEME I.;RALL, LESLIE B . ;MERRYWEATHER, 
; JAMES P. 

TITLE OF INVENTION: PREPRO INSULIN-LIKE GROWTH FACTORS 
;I AND II 

NUMBER OF SEQUENCES: 16 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/07/65,673 
FILING DATE: 16-JUN-1987 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 630,557 
; FILING DATE: 19-JUL-1984 

;SEQ ID NO: 7: 

LENGTH: 210 
5405942-7 

Query Match 36.6%; Score 191.4; DB 6; Length 210; 

Best Local Similarity 75.1%; Pred. No. 1.2e-50; 

Matches 157; Conservative 41; Mismatches 11; Indels 6; Gaps 0; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I II I I I I I I I I I : I : I I I I I I : I I I I : I I : I | | : | | : | : : | | | :: | | : | : | : | | | | | | 
Db 1 GGACCGGAGACGCUCUGCGGGGCUGAGCUGGUGGAUGCUCUUCAGUUCGUGUGUGGAGAC 60 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

I I I I I I :::: I ::: I I I I I I I II I I I I I I : I I I I : I I I I I I I : I I I I I I I I I I : II I 
Db 61 AGGGGCUUUUAUUUCAACAAGCCCACAGGGUAUGGCUCCAGCAGUCGGAGGGCGCCUCAG 120 

Qy 121 AC AGGCAT C GT GGAT GAGT GCT GCTT C CGGAGCT GT GAT CT GAGGAGGCT GGAGAT GTAC 180 

I I I I I I : I I : I I I : I I I : I I : I I : : I I I I I II : I : I I : I : I I I II I I : I I I I I : I : I 
Db 121 AC AGGUAU C GU GGAUGAGUGCUGCUU C C GGAGCU GU GAU CUAAGGAGGCU GGAGAUGUAU 180 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGC 209 

: I I I I I I I I : I I I I I I II III I I I I 
Db 181 UGCGCACCCCUCAAGCCUGCCAAGUCAGC 209 



RESULT 10 

5405942-11 

;Patent No. 5405942 

; APPLICANT: BELL, GRAEME I . ; RALL, LESLIE B. ;MERRYWEATHER, 
; JAMES P. 

TITLE OF INVENTION: PREPRO INSULIN-LIKE GROWTH FACTORS 
;I AND II 

NUMBER OF SEQUENCES: 16 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/07/65,673 
FILING DATE: 16-JUN-1987 
; PRIOR APPLICATION DATA: 

; APPLICATION NUMBER: 630,557 

; FILING DATE: 19-JUL-1984 

;SEQ ID NO: 11: 



LENGTH: 210 
5405942-11 



Query Match 36.6%; Score 191.4; DB 6; Length 210; 

Best Local Similarity 94.7%; Pred. No. 1.2e-50; 

Matches 198; Conservative 0; Mismatches 11; Indels 0; Gaps 0; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 120 

Qy 121 ACAGGCATCGT GGATGAGTGCT GCTT CCGGAGCT GT GATCT GAGGAGGCT GGAGAT GTAC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 ACAGGTATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCT GGAGAT GT AT 180 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGC 2 09 

II I I I I I I I I I I I I I I II III I I I I 

Db 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGC 2 09 



RESULT 11 
US-09-255-829-13 

; Sequence 13, Application US/09255829 
; Patent No. 6461617 
; GENERAL INFORMATION: 

APPLICANT: Shone, Clifford Charles 

APPLICANT: Quinn, Conrad Padraig 

APPLICANT: Foster, Keith Alan 

TITLE OF INVENTION: Recombinant Toxin Fragments 
; NUMBER OF SEQUENCES: 29 
; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: STERNE, KESSLER, GOLDSTEIN, & FOX P.L.L.C. 

STREET: 1100 NEW YORK AVENUE, NW, SUITE 600 

CITY: WASHINGTON 

STATE: DC 

COUNTRY: USA 
; ZIP : 20005-3934 

COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/09/255,829 

FILING DATE: 23-FEB-1999 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/GB97/ 02273 
FILING DATE: 22-AUG-1997 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/782,893 
; FILING DATE: 27-DEC-1996 

; ATTORNEY/AGENT INFORMATION: 
NAME: ESMOND, ROBERT W. 



REGISTRATION NUMBER: 32,893 
REFERENCE/ DOCKET NUMBER: 1581.0130002 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: 202-371-2600 

; TELEFAX: 202-371-2540 

INFORMATION FOR SEQ ID NO: 13: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 2862 base pairs 
TYPE: nucleic acid 
STRANDEDNESS: double 
; TOPOLOGY: linear 

; MOLECULE TYPE: DNA (genomic) 
FEATURE : 

NAME /KEY: CDS 
LOCATION: 1..2862 
US-09-255-829-13 

Query Match 36.6%; Score 191.4; DB 4; Length 2862; 

Best Local Similarity 94.7%; Pred. No. 4.8e-50; 

Matches 198; Conservative 0; Mismatches 11; Indels 0; Gaps 0; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 2644 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 2703 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2704 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 27 63 

Qy 121 ACAGG C AT C GT GGAT GAGT GCT GCTT C C GGAGCT GT GATCTGAGGAGGCT GGAGAT GT AC 180 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I 

Db 2764 ACAGGT AT C GT GGAT GAGT GCT GCTT C C GGAGCT GT GAT CTAAGGAGGCT GGAGAT GT AT 2823 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGC 209 

II I I I I I I I I I I I I I I II III I I I I 

Db 2824 TGCGCACCCCTCAAGCCTGCCAAGTCAGC 2 852 



RESULT 12 

5405942-15 

;Patent No. 5405942 

APPLICANT: BELL, GRAEME I . ; RALL, LESLIE B . ;MERRYWEATHER, 
; JAMES P. 

TITLE OF INVENTION: PREPRO INSULIN-LIKE GROWTH FACTORS 
;I AND II 

; NUMBER OF SEQUENCES: 16 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/07/65,673 
FILING DATE: 16-JUN-1987 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 630,557 
FILING DATE: 19-JUL-1984 
;SEQ ID NO: 15: 
; LENGTH: 210 

5405942-15 



Query Match 



36.3%; Score 189.8; DB 6; Length 210; 



Best Local Similarity 74.6%; Pred. No. 3.8e-50; 

Matches 156; Conservative 41; Mismatches 12; Indels 0; Gaps 0; 



Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I : I : I I I I I I : I I I I : I I : I I I : I I : I : : I I I : : I I : I : I : I I I II I 
Db 1 GGACCGGAGACGCUCUGCGGGGCUGAGCUGGUGGAUGCUCUUCAGUUCGUGUGUGGAGAC 60 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

I I I I I I :::: I ::: I I I I I I I I I I I I I I I : I I I I : I M I I I I : I I I I II I I I I : I I I 
Db 61 AGGGGCUUUUAUUUCAACAAGCCCACAGGGUAUGGCUCCAGCAGUCGGAGGGCGCCUCAG 120 

Qy 121 AC AGGCAT C GT GGAT GAGT GCT GCT T C C GGAGCT GT GAT CT GAGGAGGCTGGAGAT GT AC 18 0 

I I I I I I : I I : I I I : I I I : II : I I : : I I I I I I I : I : I I : I : I I I I I I I : I I I I I : I : I 
Db 121 ACAGGUAUC GU GGAU GAGU GCUGCUUCC G GAGCU GUGAU CUAAGGAGGCUGGAGAUGUAU 18 0 

Qy 181 T GT GCACCCCT CAAGCC GGCAAAGGCAGC 2 09 

: I 1111111:11 III II III I I I I 
Db 181 UGCGCACCCCUCAGGCCUGCCAAGUCAGC 2 09 



RESULT 13 
US-08-308-196A-1 

Sequence 1, Application US/08308196A 
Patent No. 5612198 
GENERAL INFORMATION: 

APPLICANT: Brierley, Russell A. 
APPLICANT: Davis, Geneva R. 
APPLICANT: Holtz, Gregory C. 
APPLICANT: Gleeson, Martin A. 
APPLICANT: Howard, Bradley D. 

TITLE OF INVENTION: Production of Insulin-Like Growth 
TITLE OF INVENTION: Factor-1 in Methylotrophic Yeast Cells 
NUMBER OF SEQUENCES: 17 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Brown, Martin, Haller & McClain 
STREET: 1660 Union Street 
CITY: San Diego 
STATE: California 
COUNTRY: USA 
ZIP: 92101-2926 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/308, 196A 
FILING DATE: 09-SEPT-1994 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/983,523 
FILING DATE: 03-MAR-1993 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/578,728 
FILING DATE: 04-SEP-1990 
ATTORNEY/ AGENT INFORMATION: 
NAME: Seidman, Stephanie L. 
REGISTRATION NUMBER: 33,77 9 



REFERENCE/ DOCKET NUMBER: 51875 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (619)238-0999 
TELEFAX: (619)238-0062 
; INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 240 base pairs 
TYPE: nucleic acid 
; STRANDEDNESS: double 

; TOPOLOGY: unknown 

MOLECULE TYPE: cDNA 
FEATURE : 

NAME/ KEY: CDS 
LOCATION: 14.. 232 
US-08-308-196A-1 

Query Match 35.4%; Score 185.2; DB 1; Length 240; 

Best Local Similarity 91.6%; Pred. No. l.le-48; 

Matches 196; Conservative 0; Mismatches 18; Indels 0; Gaps 0; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 17 GGACCGGAGACGCTCTGCGGGGCTGAGCTCGTGGATGCTCTGCAGTTCGTGTGTGGAGAC 7 6 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I 
Db 77 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGACGGGCGCCTCAG 136 

Qy 121 ACAGGCAT CGT G GAT GAGT GCT GCTT C C GGAGCT GT GAT CT GAGGAGGCT GGAGAT GT AC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 137 ACAGGCAT C GT GGAT GAGT GCT GCTT C C GGAGCT GT GAT CTAAGGAGGCT C GAGAT GTAT 196 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCT 214 

I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 197 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTTGAT 230 



RESULT 14 
PCT-US91-06452-1 

Sequence 1, Application PC/TUS9106452 
GENERAL INFORMATION: 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 

APPLICANT: Bradley, 
TITLE OF INVENTION: 
TITLE OF INVENTION: 
NUMBER OF SEQUENCES: 



Brierley, Russell A. 
Davis , Geneva R. 
Holtz, Gregory C. 
Gleeson, Martin A. 
D. H. 

Production of Insulin-Like Growth 
Factor-1 in Methylotrophic Yeast Cells 
12 

CORRESPONDENCE ADDRESS: 

ADDRESSEE: Fitch, Even, Tabin & Flannery 

STREET: 135 South LaSalle Street, Suite 900 

CITY: Chicago 

STATE: Illinois 

COUNTRY: USA 

ZIP: 60603 
COMPUTER READABLE FORM: 



MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.25 
; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US 91/ 06452 
FILING DATE: 19910409 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: US 07/578,728 

; FILING DATE: 04-SEP-1990 

ATTORNEY/AGENT INFORMATION: 
; NAME: Seidman, Stephanie L. 

REGISTRATION NUMBER: 33,779 
REFERENCE/ DOCKET NUMBER: 51874 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: (619)552-1311 

TELEFAX: (619)552-0095 
; INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 240 base pairs 
TYPE: NUCLEIC ACID 
STRANDEDNESS: double 
TOPOLOGY: unknown 
MOLECULE TYPE: cDNA 
FEATURE : 

NAME/ KEY: CDS 
LOCATION: 14.. 232 
PCT-US91-06452-1 

Query Match 35.4%; Score 185.2; DB 5; Length 240; 

Best Local Similarity 91.6%; Pred. No. l.le-48; 

Matches 196; Conservative 0; Mismatches 18; Indels 0; Gaps 0; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 17 GGACCGGAGACGCTCTGCGGGGCTGAGCTCGTGGATGCTCTGCAGTTCGTGTGTGGAGAC 76 

Qy 61 AGGGG CT TT TAT TT CAACAAG C CCACAGGATACGGCT C C AGCAGT C GGAGGGCACCT C AG 120 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I 
Db 77 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGACGGGCGCCTCAG 136 

Qy 121 ACAGGCAT C GT G GAT GAGT GCT GCT T C C GGAGCT GT GAT CT GAGGAGGCT GGAGAT GT AC 180 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 137 ACAGGCAT C GT GGAT GAGT GCT GCT T C C GGAGCT GT GAT CT AAGGAG G CT CGAGAT GT AT 196 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCT 214 

I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 197 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTTGAT 230 

RESULT 15 
US-09-029-267-13 

; Sequence 13, Application US/09029267 
; Patent No. 6107057 
; GENERAL INFORMATION: 

APPLICANT: Crawford, Kenneth 
; APPLICANT: Zaror, Isabel 



APPLICANT : Innis, Michael 

TITLE OF INVENTION: Pichia Secretory Leader for Protein 
TITLE OF INVENTION: Expression 
NUMBER OF SEQUENCES: 4 0 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: Chiron Corporation 
STREET: 4560 Horton Street 
CITY: Emeryville 
STATE: California 
COUNTRY: United States 
ZIP : 94608 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/029, 267 
FILING DATE: 
CLASSIFICATION: 
ATTORNEY/ AGENT INFORMATION: 
NAME: Chung, Ling-Fong 
REGISTRATION NUMBER: 36,482 
REFERENCE/ DOCKET NUMBER: 1165.100 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (510) 601-2704 
TELEFAX: (510) 655-3542 
INFORMATION FOR SEQ ID NO: 13: 
SEQUENCE CHARACTERISTICS : 
LENGTH: 390 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: other nucleic acid 
DESCRIPTION: /desc = "Synthetic" 
US-09-029-267-13 

Query Match 35.4%; Score 185.2; DB 3; Length 390; 

Best Local Similarity 91.6%; Pred. No. 1.5e-48; 

Matches 196; Conservative 0; Mismatches 18; Indels 0; Gaps 0; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 160 GGACCGGAGACGCTCTGCGGGGCTGAGCTCGTGGATGCTCTGCAGTTCGTGTGTGGAGAC 219 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I 
Db 22 0 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGACGGGCGCCTCAG 27 9 

Qy 121 ACAGGCAT CGTGGAT GAGT GCTGCTT CCGGAGCT GT GAT CT GAGGAGGCT GGAGAT GTAC 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 280 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTCGAGATGTAT 339 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCT 214 

I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 34 0 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTTGAT 373 

Search completed: December 13, 2003, 11:44:50 
Job time : 49.3585 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on : 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



December 13, 2003, 07:29:55 ; Search time 233.512 Seconds 

(without alignments) 
7443.919 Million cell updates/sec 

US-09-852-261-5 
523 

1 ggaccggagacgctctgcgg aaatacacaagtaaacattc 523 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



2201672 seqs, 1661799599 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



4403344 



Database 



Published_Applications_NA: * 

1 : /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB.seq: * 

2 : /cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB.seq: * 

3 : /cgn2_6/ptodata/2/pubpna/US06_NEW_PUB.seq: * 

4 : /cgn2_6/ptodata/2/pubpna/US06_PUBCOMB. seq: * 

5 : /cgn2_6/ptodata/2/pubpna/US07_NEW_PUB.seq: * 

6: /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB.seq:* 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW__PUB.seq: * 

8 : /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB.seq: * 

9: /cgn2_6/ptodata/2/pubpna/US09A_PUBCOMB.seq:* 

10: /cgn2_6/ptodata/2/pubpna/US09B_PUBCOMB.seq: * 

11: /cgn2_6/ptodata/2/pubpna/US09C__PUBCOMB.seq: * 

12 : /cgn2__6/ptodata/2/pubpna/US09_NEW_PUB . seq: * 

13: /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB.seq2: * 

14: /cgn2_6/ptodata/2/pubpna/US10A_PUBCOMB.seq: * 

15: /cgn2_6/ptodata/2/pubpna/US10B_PUBCOMB.seq: * 

16: /cgn2_6/ptodata/2/pubpna/USl0_NEW_PUB.seq:* 

17 : /cgn2_6/ptodata/2/pubpna/US60_NEW_PUB. seq: * 

18 : /cgn2_6/ptodata/2/pubpna/US60_PUBCOMB. seq: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
US-09-852-261-5 

; Sequence 5, Application US/09852261 

; Patent No. US20020083477A1 

; GENERAL INFORMATION: 

; APPLICANT: GOLDSPINK, GEOFFREY 



APPLICANT: TERENGHI, GIORGIO 
TITLE OF INVENTION: REPAIR OF NERVE DAMAGE 
FILE REFERENCE: 117-351 

CURRENT APPLICATION NUMBER: US/09/852 , 261 
CURRENT FILING DATE: 2001-05-10 
PRIOR APPLICATION NUMBER: GB 0011278.9 
PRIOR FILING DATE: 2000-05-10 
NUMBER OF SEQ ID NOS : 14 
SOFTWARE: Patent In Ver. 2.1 
SEQ ID NO 5 
LENGTH: 523 
TYPE: DNA 

ORGANISM: Oryctolagus cuniculus 
US-09-852-261-5 

Query Match 100.0%; Score 523; DB 9; Length 523; 

Best Local Similarity 100.0%; Pred. No. 1.9e-161; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I i I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I 

Db 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCT T T T ATTT CAACAAGC CCACAGGAT AC GGCT C CAGCAGT CGGAGGGCAC CT CAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 61 AGGGGCT TT T ATTT CAACAAG C CCACAGGAT AC GGCT C CAGCAGT C GGAGGGCAC CT CAG 120 

Qy 121 ACAGGCAT C GT GGAT GAGT GCT GCTT C C GGAGCT GTGAT CT GAGGAGG CT GGAGAT GTAC 180 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 ACAGGCAT C GT GGAT GAGT GCT GCTT C C GGAGCT GTGAT CT GAGGAGGCT GGAGAT GTAC 180 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

Qy 241 ATGC C CAAGACT CAGAAGTAT CAGCCT C CAT CT ACCAACAAGAAAATGAAGT CT CAGAGG 3 00 

I I I I I II I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 241 ATGC CCAAGACT CAGAAGTAT CAGCCT C CAT CT ACCAACAAGAAAATGAAGT CT CAGAGG 300 

Qy 301 AGAAGGAAAGGAAGT AC AT T T GAAGAACACAAGT AGAGGGAGT GCAGGAAACAAGAACT A 3 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 AGAAGGAAAGGAAGT AC AT T T GAAGAACACAAGT AGAGGGAGT GCAGGAAACAAGAACT A 360 

Qy 361 CAGGAT GT AGGAAGAC C CT T CT GAGGAGT GAAGAAGGACAGGC CAC CG C AGGACC CT TT G 420 

I I I I I I M I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 361 CAGGAT GT AGGAAGAC C CT T CT GAGGAGT GAAGAAGGACAGGC CAC CG C AGGACC CT TT G 420 

Qy 421 CTCT GCACAGTTACCT GTAAACATTGGAATACCGGCCAAAAAATAAGTTT GAT CACATTT 4 80 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 421 CTCT GCACAGTTACCT GTAAACATTGGAATACCGGCCAAAAAATAAGTTT GAT CACATTT 480 

Qy 481 CAAAGATGGCATTTCCCCCAAT GAAATACACAAGTAAACATT C 523 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 CAAAGAT GG C ATTT C C C C CAAT GAAATACACAAGTAAACATT C 523 



RESULT 2 



US-09-852-261-1 

Sequence 1, Application US/09852261 
Patent No. US20020083477A1 
GENERAL INFORMATION: 
APPLICANT: GOLDSPINK, GEOFFREY 
APPLICANT: TERENGHI r GIORGIO 
TITLE OF INVENTION: REPAIR OF NERVE DAMAGE 
FILE REFERENCE: 117-351 

CURRENT APPLICATION NUMBER: US/09/852,261 
CURRENT FILING DATE: 2001-05-10 
PRIOR APPLICATION NUMBER: GB 0011278.9 
PRIOR FILING DATE: 2000-05-10 
NUMBER OF SEQ ID NOS : 14 
SOFTWARE: Patent In Ver. 2.1 
SEQ ID NO 1 
LENGTH: 517 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-852-261-1 

Query Match 89.4%; Score 467.4; DB 9; Length 517; 

Best Local Similarity 96.2%; Pred. No. 3.8e-143; 

Matches 501; Conservative 0; Mismatches 16; Indels 4; Gaps 2; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCT TT T ATT T CAACAAGCCCACAGGATACGGCT CCAGCAGT C GGAGGGCAC CT C AG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 120 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 180 

Qy 181 T GT G C AC CC CT C AAGC C G GCAAAGGC AGC C CGCT C C GT C C GT GC C C AGC GC C AC AC C GAC 24 0 

II I I I I I I I I I I I I I I II III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 240 

Qy 241 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCAACAAGAAAATGAAGTCTCAGAGG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 AT G C C C AAG AC C C AG AAG T AT C AG C C C C CAT C T AC C AAC AAG AAC AC G AAGT C T C A G 297 

Qy 301 AGAAGGAAAGGAAGTACATTT GAAGAACACAAGTAGAGGGAGT GCAGGAAACAAGAACT A 360 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 298 AGAAGGAAAGGAAGTACATTT GAAGAACACAAGTAGAGGGAGT GCAGGAAACAAGAACT A 357 

Qy 361 CAGGAT GT AGGAAGACC CTT CT GAGGAGT GAAGAAGGACAGGCCAC C GC AG GAC C CTTT G 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 358 CAG GAT GT A- GAAGACC CTT CT GAGGAGT GAAGAAGGACAGGCCAC CGCAGGACC CTT T G 416 

Qy 421 CT CT GCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGAT CACATTT 480 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 417 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGAT CACATTT 476 

Qy 481 CAAAGAT GGC AT T T C CC C CAAT GAAAT ACACAAGT AAAC AT 521 



1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 477 CAAAGAT G GCAT T T C C C C CAAT GAAATACACAAGTAAAC AT 517 



RESULT 3 

US-09-852-261-13 

; Sequence 13, Application US/09852261 

; Patent No. US20020083477A1 

; GENERAL INFORMATION: 

; APPLICANT: GOLDSPINK, GEOFFREY 

; APPLICANT: TERENGHI, GIORGIO 

; TITLE OF INVENTION: REPAIR OF NERVE DAMAGE 

; FILE REFERENCE: 117-351 

; CURRENT APPLICATION NUMBER: US/09/852, 261 

; CURRENT FILING DATE: 2001-05-10 

; PRIOR APPLICATION NUMBER: GB 0011278.9 

; PRIOR FILING DATE: 2000-05-10 

; NUMBER OF SEQ ID NOS : 14 

; SOFTWARE: Patentln Ver. 2.1 

; SEQ ID NO 13 

LENGTH: 471 

TYPE: DNA 
; ORGANISM: Oryctolagus cuniculus 
US-09-852-261-13 

Query Match 78.2%; Score 409; DB 9; Length 471; 

Best Local Similarity 90.1%; Pred. No. 6.1e-124; 

Matches 471; Conservative 0; Mismatches 0; Indels 52; Gaps 1; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I ! I I I I I I I I I I I I I II I I I I I I 

Db 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCT T T TAT T T CAACAAGCCCAC AGGAT AC GGCT C CAGCAGT C G GAGGGCAC CT CAG 120 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 AGGGG CT T TT AT T T CAACAAGCCCAC AGGAT AC GGCT C CAGCAGT C GGAGGGCAC CT CAG 120 

Qy 121 ACAGG C AT C GT GGAT GAGT GCT GCT T C C GGAGCT GT GAT CT GAG GAGGCT GGAGAT GT AC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 121 ACAGGCAT CGT GGAT GAGT GCTGCTT CCGGAGCT GT GAT CT GAG GAGGCT GGAGAT GT AC 180 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I 
Db 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

Qy 241 AT GC C CAAGACT CAGAAGT ATCAGC CT C C AT CT AC CAACAAGAAAAT GAAGT CT C AGAGG 300 

I I I I I I I I I I I I I I I 

Db 241 ATGCCCAAGACTCAG 255 

Qy 301 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I 
Db 256 AAGGAAGT ACAT TT GAAGAACACAAGT AGAGGGAGT G C AGGAAACAAGAACTA 308 

Qy 361 CAGGAT GTAGGAAGAC C CT T CT GAGGAGT GAAGAAG GACAGGC CAC C GCAGGAC C CT TT G 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I 
Db 309 CAGGAT GTAGGAAGAC C CT TCT GAGGAGT GAAGAAGGACAGGC CAC C GCAGGACC CT TT G 368 



Qy 421 CT CT GC AC AGT T ACCT GT AAAC ATT GGAAT AC CGGC CAAAAAAT AAGTT T GATCAC AT T T 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 369 CT CT GC ACAGT T ACCT GTAAACATT GGAAT AC CGGC CAAAAAAT AAGTT T GATCAC ATT T 428 

Qy 481 CAAAGAT GGCAT T T CC C C CAAT GAAAT ACACAAGT AAAC AT T C 523 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 429 CAAAGAT GGCATT T C C C CCAAT GAAAT ACACAAGT AAAC AT T C 471 



RESULT 4 
US-09-852-261-3 

; Sequence 3, Application US/09852261 

; Patent No. US20020083477A1 

; GENERAL INFORMATION: 

; APPLICANT: GOLDSPINK, GEOFFREY 

; APPLICANT: TERENGHI, GIORGIO 

; TITLE OF INVENTION: REPAIR OF NERVE DAMAGE 

; FILE REFERENCE: 117-351 

; CURRENT APPLICATION NUMBER: US/09/852 , 261 

; CURRENT FILING DATE: 2001-05-10 

; PRIOR APPLICATION NUMBER: GB 0011278.9 

; PRIOR FILING DATE: 2000-05-10 

; NUMBER OF SEQ ID NOS : 14 

SOFTWARE: PatentlnVer. 2.1 
; SEQ ID NO 3 

LENGTH: 539 
; TYPE: DNA 
; ORGANISM: Rattus sp . 
US-09-852-261-3 

Query Match 68.2%; Score 356.8; DB 9; Length 539; 

Best Local Similarity 82.3%; Pred. No. 9.9e-107; 

Matches 436; Conservative 0; Mismatches 87; Indels 7; Gaps 2; 

1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 
1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

61 AGGGGCT T T T ATTT CAACAAGCC CACAGGAT ACGGCT C CAGCAGT C GGAGGGC AC CT CAG 120 

I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II III 
61 AGGGGCT T T T ACTT CAACAAGCC C AC AGT CT AT G GCT C CAGCAT T C GGAGGGC AC C AC AG 12 0 

121 ACAGG C AT C GT GGAT GAGT GCT GCT T C C GGAGCT GT GAT CTGAGGAGGCT GGAGAT GT AC 180 

II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
121 AC GGGC AT T GT GGAT GAGT GT T GCT T C C GGAGCT GT GAT CT GAGGAGGCT GGAGAT GT AC 180 

181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

241 AT GC C CAAGACT C AGAAGT AT CAGC CT C CAT CT AC CAACAAGAAAAT GAAGT CT CAGAGG 300 

I I I I I I I I I I I I I I I I I I I I II I I I III II II I I I I I I I I I I I II I M 
241 AT GCC CAAGACTCAGAAGT C CCAGC C C C TAT CGAC AC ACAAGAAAAGGAAGCT GCAAAGG 300 

301 AGAAGGAAAGGAAGT AC AT T T GAAGAACACAAGT AGAGGGAGT GC AGGAAACAAGAACTA 360 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
301 AGAAG GAAAGGAAGTACACTT GAAGAACACAAGT AGAGGAAGT GCAGGAAACAAGACCT A 360 



Qy 

Db 

Qy 

Db 

Qy 

Db 
Qy 

Db 

Qy 

Db 

Qy 

Db 



Qy 
Db 

Qy 

Db 

Qy 

Db 



361 CAGGAT GT AGGAAGAC C CT T CTGAGGAGT GAAGAAGGAC AGGCCAC CGC AGGAC C CT T T G 420 

III I I I II I I I II III I I I I I I I II I II I I M I I I I II I I M I I 

361 CAGAAT GT AGGAGGAG C CT CCCGAGGAACAGAAAAT GC C AC GT CAC C GCAAGAT C CT T T G 420 

421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA AAAAAT AAGT T T GAT C 474 

II I II I I I I I I II I I I I I I I I I I I I I I I I M 

421 CT GCTT GAGCAAC CT G CAAAACAT C GGAACAC C T GC CAAAT ATCAATAAT GAGT T CAAT A 480 

475 ACATTT CAAAGAT - GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
481 T CATTT CAGAGAT GGGC ATT T CCCT CAAT GAAAT AC ACAAGTAAAC AT T C 530 



RESULT 5 
US-10-161-088-1 

Sequence 1, Application US/10161088 
Publication No. US20030077761A1 
GENERAL INFORMATION: 
APPLICANT: Parrow, Vendela 
APPLICANT: Rosengren, Linda 
TITLE OF INVENTION: NEW METHODS 
FILE REFERENCE: 13425-111001 
CURRENT APPLICATION NUMBER: US/10/161,088 
CURRENT FILING DATE: 2002-05-31 
PRIOR APPLICATION NUMBER: SE 0101934-8 
PRIOR FILING DATE: 2001-06-01 
NUMBER OF SEQ ID NOS : 3 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 1 
LENGTH: 651 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: (73) . . . (471) 
US-10-161-088-1 



Query Match 66. 8%; 

Best Local Similarity 82.8%; 
Matches 425; Conservative 



Score 349.4; DB 15; 
Pred. No. 3e-104; 
0; Mismatches 81; 



Length 651; 
Indels 7; 



Gaps 



2; 



Qy 



Db 



139 



GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCG 198 



Qy 

Db 

Qy 

Db 



61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I 

199 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCTCAG 258 

121 ACAGGCAT CGT GGAT GAGTGCT GCTT CCGGAGCT GT GATCTGAGGAGGCTGGAGAT GTAC 180 

I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

259 ACAGGCAT T GT GGAT GAGT GT T GCTT C C GGAGCT GT GATCT GAGGAGACT G GAGAT GTAC 318 



Qy 

Db 



181 
319 



TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

I I I I I II II I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II III 

TGTGCCCCACTGAAGCCTACAAAAGCAGCCCGCTCTATCCGTGCCCAGCGCCACACTGAC 378 



Qy 241 ATGCCCAAGACT CAGAAGTATCAGCCTCCAT CTACCAACAAGAAAATGAAGT CT CAGAGG 300 

II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I M III 
Db 379 ATGCCCAAGACTCAGAAGT CCCCGTCCCT AT CGACAAACAAGAAAACGAAGCT GCAAAGG 438 

Qy 301 AGAAGGAAAGGAAGT AC AT T T GAAGAAC ACAAGT AGAGGGAGT GCAG GAAACAAGAACT A 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 439 AGAAGGAAAGGAAGT AC AT T T GAAGAAC ACAAGT AGAGGAAGT GCAGGAAACAAGAC CT A 498 

Qy 361 CAGGAT GT AGGAAGAC C CT T CT GAGGAGT GAAGAAGGAC AGG CCACC GCAGGAC C CTT T G 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 499 CAGAATGT AGGAGGAGCCT CCCACGGAGCAGAAAAT GCCACATCACCGCAGGAT CCTTT G 558 

Qy 421 CTCT GCAC AGTT AC CT GTAAACATTGGAAT AC C GGC C A AAAAAT AAGT T T GAT C 474 

II ! I I I I I I I II I I I III I I I I I I I I I II 

Db 559 CT GCTT GAGCAAC CT GCAAAACATC GAAACACCT AC CAAAT AACAATAATAAGT C CAATA 618 

Qy 475 AC AT T T C AAAG AT - GG CAT T T C C C C C AAT G AAA 506 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 619 ACATT ACAAAGAT GGGC AT T T CCC C CAAT GAAA 651 



RESULT 6 

US-09-919-497-24 

; Sequence 24, Application US/09919497 

; Patent No. US20020106662A1 

; GENERAL INFORMATION: 

; APPLICANT: Mutter, George L. 

; TITLE OF INVENTION: PROGNOSTIC CLASSIFICATION OF ENDOMETRIAL CANCER 
; FILE REFERENCE: B0801/7225 

; CURRENT APPLICATION NUMBER: US/09/919, 4 97 

; CURRENT FILING DATE: 2001-07-31 

; PRIOR APPLICATION NUMBER: US 60/221,735 

; PRIOR FILING DATE: 2000-07-31 

; NUMBER OF SEQ ID NOS : 100 

; SOFTWARE : Patentln version 3.0 

; SEQ ID NO 24 

LENGTH: 7260 

TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-919-497-24 

Query Match 63.9%; Score 334.4; DB 10; Length 7260; 

Best Local Similarity 84.6%; Pred. No. 9e-99; 

Matches 445; Conservative 0; Mismatches 26; Indels 55; Gaps 4; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 311 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 370 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 371 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 430 

Qy 121 ACAGGCAT C GT GGAT GAGT GCTGCTTC C GGAGCT GT GAT CT GAGGAGGCT GGAGAT GT AC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 431 ACAGGCAT CGT GGATGAGTGCT GCTT CCGGAGCT GT GATCTAAGGAGGCT GGAGAT GTAT 490 



Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

II I I I I I I I I I I I I I I II Ml I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

Db 4 91 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 550 

Qy 241 AT GC C CAAGACT C AGAAGT AT CAGC CT C CAT CT AC CAACAAGAAAAT GAAGT CT CAGAGG 300 

II I I I I I I I I I III 

Db 551 AT GC C CAAGAC C C AG 565 

Qy 301 AGAAGGAAAGGAAGT AC AT TT GAAGAAC ACAAGT AGAGGGAGT GCAGGAAACAAGAACT A 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 566 AAGGAAGT AC AT T T GAAGAAC GCAAGT AGAGGGAGT GCAGGAAACAAGAACT A 618 

Qy 361 CAGGAT GT AGGAAGAC C CTT CT GAGGAGT GAAGAAGGACAGGCC AC CGC AGGAC CCTTT G 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 619 CAGGAT GT AGGAAGAC C CT CCT GAGGAGT GAAGAGT GACATGCCACCGCAGGAT CCTTTG 678 

Qy 421 CT CT GC AC - AGTT AC CT G- TAAACATT GGAAT AC C GGCCAAAAAAT AAGT T T GAT C ACAT 478 

I I I I I I II I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I 

Db 679 CT CT GC AC GAGTT AC CT GT TAAACTTT G GAAC AC CT AC CAAAAAATAAGT T T GATAACAT 738 

Qy 47 9 TT CAAAGAT - GGCAT T T C C C C CAAT GAAAT ACACAAGT AAACATT C 523 

II I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 739 TTAAAAGAT GGGC GT T T C C C C CAAT GAAAT ACACAAGT AAACATT C 784 



RESULT 7 

US-09-880-107-3739 

; Sequence 3739, Application US/09880107 

; Patent No. US20020142981A1 

; GENERAL INFORMATION: 

; APPLICANT: Home, Darci T. 

; APPLICANT: Vockley, Joseph G. 

; APPLICANT: Scherf, Uwe 

; APPLICANT: Gene Logic, Inc. 

TITLE OF INVENTION: Gene Expression Profiles in Liver Cancer 
; FILE REFERENCE: 44 921-502 8-WO 
; CURRENT APPLICATION NUMBER: US/09/880,107 
; CURRENT FILING DATE: 2001-06-14 
; PRIOR APPLICATION NUMBER: US 60/211,379 
; PRIOR FILING DATE: 2000-06-14 
; PRIOR APPLICATION NUMBER: US 60/237,054 
; PRIOR FILING DATE: 2000-10-02 
; NUMBER OF SEQ ID NOS : 3950 
; SOFTWARE: PatentlnVer. 2.1 
; SEQ ID NO 3739 
LENGTH: 7260 
; TYPE: DNA 
; ORGANISM: Homo sapiens 
; FEATURE : 

OTHER INFORMATION: Genbank Accession No. US20020142981A1 X57025 
US-09-880-107-3739 



Query Match 63.9%; Score 334.4; DB 10; Length 7260; 

Best Local Similarity 84.6%; Pred. No. 9e-99; 

Matches 445; Conservative 0; Mismatches 26; Indels 55; Gaps 



4; 



Qy 


1 


Db 


311 


Qy 


61 


Db 


371 


Qy 


121 


Db 


431 


Qy 


181 


Db 


491 


Qy 


241 


Db 


551 


Qy 


301 


Db 


566 


Qy 


361 


Db 


619 


Qy 


421 


Db 


679 


Qy 


479 


Db 


739 



GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I N I I I I I I I I I I I 

GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 370 

AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 

I I M I I I I I I I I I I I II I I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I I I I I I I I 

AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 430 

ACAGGC AT CGT GGAT GAGT GCTGCTTC C GGAGCT GT GAT CT GAGGAGGCT GGAGAT GT AC 180 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ACAGGC AT CGT GGAT GAGT GCTGCTTC C GGAGCT GT GAT CT AAGGAGGCT G GAGAT GT AT 490 

TGTGCACCCCTCAAGCCGGCA7UVGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

II I I I I II I I I I I I I I II III MM I I II I I I I II II II I I I I I II I I I I II II 

TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 550 

AT GCCCAAGACT CAGAAGT AT CAGCCTC CAT CTACCAACAAGAAAAT GAAGTCT CAGAGG 300 

I I I II I I I II I III 

AT GC C CAAGAC C CAG " 565 

AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

II II I I II II I I II I M II II I II II I II II II II II I II II I I I II II I II 
AAGGAAGT ACAT T T GAAGAAC GCAAGT AGAGGGAGT GCAGGAAACAAGAACT A 618 

CAGGAT GTAGGAAGACCCTT CT GAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 420 

M I II II II I I M II II II I I I II I II II II II I II I II I I II I II I I I II I I I I 

CAGGAT GTAGGAAGACCCT CCTGAGGAGT GAAGAGTGACATGCCACCGCAGGAT CCTTT G 678 
CT CT GCAC- AGT T ACCT G- TAAACAT T GGAAT AC C GGC CAAAAAATAAGT T T GATCAC AT 478 

II I I I I I I I II I II II I I I I I I I II I II III II II II I M I I I I I I I M MM 

CT CT GCACGAGTT AC CT GT T AAACT T T GGAACAC CT AC CAAAAAAT AAGTT T GAT AAC AT 738 
TT CAAAGAT - GGCATTTCC CCCAAT GAAATACACAAGTAAACATT C 523 

M I II I I I I I I II I I I I II M II II I I I I I II I II I I M I I II 

TTAAAAGAT GGGCGTTTCCCC CAAT GAAATACACAAGTAAAC AT T C 78 4 



RESULT 8 

US-09-873-319-707 

; Sequence 707, Application US/09873319A 

; Publication No. US20030134324A1 

; GENERAL INFORMATION: 

; APPLICANT: Munger, William E. 

; APPLICANT: Kulkarni, Prakash 

; APPLICANT: Getzenberg, Robert H. 

; APPLICANT: Waga, Iwao 

; APPLICANT: Yamamoto, Jun 

TITLE OF INVENTION: Identifying Drugs for and Diagnosis of Benign Prostatic 
; TITLE OF INVENTION: Hyperplasia Using Gene Expression Profiles 
; FILE REFERENCE: 44921-5029-US 
; CURRENT APPLICATION NUMBER: US/ 09/ 873 , 319A 
; CURRENT FILING DATE: 2001-06-05 
; EARLIER APPLICATION NUMBER: US 60/223,323 
; EARLIER FILING DATE: 2000-08-07 
; NUMBER OF SEQ ID NOS : 755 
; SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 7 07 



LENGTH: 7260 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE: 

OTHER INFORMATION: Genbank Accession No. US20030134324A1 X57025 
US-09-873-319-707 

Query Match 63.9%; Score 334.4; DB 13; Length 7260; 

Best Local Similarity 84.6%; Pred. No. 9e-99; 

Matches 445; Conservative 0; Mismatches 26; Indels 55; Gaps 4; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 311 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 37 0 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I II I I I I I I I I I I I I I I I I I I I I I 

Db 371 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 430 

Qy 121 AC AGGCAT C GT GGATGAGT GCT GCT T C C GGAGCT GT GAT CT GAGGAGGCTGGAGAT GT AC 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 431 AC AGGCAT CGT GGATGAGT GCTGCTT CCGGAGCT GT GAT CTAAGGAGGCTGGAGAT GTAT 490 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

II I I I I I I I I I I I I I I II III I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 

Db 491 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 550 

Qy 241 AT GCCCAAGACT CAGAAGTATCAGCCTCCAT CTACCAACAAGAAAAT GAAGTCTCAGAGG 300 

I I I I I I I I I I I III 

Db 551 AT GC C CAAGAC C C AG 565 

Qy 301 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 566 AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAACAAGAACTA 618 

Qy 361 CAGGATGTAGGAAGACCCTTCT GAGGAGT GAAGAAGGACAGGCCACCGCAGGACCCTTT G 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 619 CAGGATGT AGGAAGACCCTCCT GAGGAGT GAAGAGT GACATGCCACCGCAGGATCCTTT G 67 8 

Qy 421 CT CT GC AC - AGT T ACCT G-TAAACATT GGAAT AC C GGC CAAAAAAT AAGTTT GAT CAC AT 47 8 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I 
Db 679 CT CT GCACGAGTTACCTGTTAAACTTT GGAACACCT AC CAAAAAAT AAGTTTGATAACAT 738 

Qy 479 TT CAAAGAT - GGCATTT C C C C CAAT GAAAT AC ACAAGTAAAC AT T C 523 

I I MINI III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 739 TTAAAAGAT GGGCGTTTCCCCCAAT GAAAT ACACAAGTAAAC ATT C 784 



RESULT 9 

US-09-960-706-1066 

; Sequence 1066, Application US/09960706 

; Publication No. US20030134280A1 

; GENERAL INFORMATION: 

; APPLICANT: Munger, William E. 

; TITLE OF INVENTION: Identifying Drugs for and Diagnosis of Benign Prostatic 
Hyperplasia Using 

; TITLE OF INVENTION: Gene Expression Profiles 



; FILE REFERENCE: 44921-5029-01US 

; CURRENT APPLICATION NUMBER: US/09/960, 706 

; CURRENT FILING DATE: 2001-09-24 

; PRIOR APPLICATION NUMBER: 60/223,323 

; PRIOR FILING DATE: 2000-08-07 

; PRIOR APPLICATION NUMBER: 09/873,319 

; PRIOR FILING DATE: 2001-06-05 

; NUMBER OF SEQ ID NOS : 1124 

; SOFTWARE: Patent In Ver. 2.1 

; SEQ ID NO 1066 

LENGTH: 7260 

TYPE: DNA 
; ORGANISM: Homo sapiens 

FEATURE : 

OTHER INFORMATION: Genbank Accession No. US2 0030134280A1 X57025 
US-09-960-706-1066 

Query Match 63.9%; Score 334.4; DB 13; Length 7260; 

Best Local Similarity 84.6%; Pred. No. 9e-99; 

Matches 445; Conservative 0; Mismatches 26; Indels 55; Gaps 4; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 311 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 370 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 371 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 430 

Qy 121 ACAGGCAT C GT GGAT GAGT GCT GCT T C C GGAGCT GT GAT CT GAGGAGGCT GGAGAT GT AC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 431 ACAGGCAT C GT GGAT GAGT GCT GCT T C C GGAGCT GT GAT CTAAGGAGGCT GGAGAT GT AT 490 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

II I I I I I I I I I I I I I I II III I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

Db 491 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 550 

Qy 241 AT GCCCAAGACT CAGAAGTATCAGCCT CCATCTACCAACAAGAAAATGAAGTCTCAGAGG 300 

I I I I I I I I I I I I II 

Db 551 AT GC C C AAGAC C C AG 565 

Qy 301 AGAAGGAAAGGAAGT ACAT T T GAAGAAC ACAAGT AGAGGGAGT GCAGGAAACAAGAACT A 360 

I II I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 566 AAGGAAGTACAT T T GAAGAAC GCAAGT AGAGGGAGT GCAGGAAACAAGAACT A 618 

Qy 361 CAGGAT GT AGGAAGAC C CTT CT GAGGAGT GAAGAAG GACAGGC CACCGCAGGACC CT T T G 420 

I I I I I I I I I I I I I I I I I I I I I II I M I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 619 CAGGAT GT AGGAAGAC C CT C CT GAGGAGT GAAGAGT GACAT GCCACC GCAGGAT C CT T T G 678 

Qy 421 CT CT GCAC- AGTT AC CT G- TAAACAT T GGAAT AC C GGCCAAAAAAT AAGT T T GAT C ACAT 478 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I II I I I I I I I I I I I I I I I 

Db 679 CT CT GCAC GAGT TAC CT GT T AAACTT T GGAAC AC CT ACCAAAAAATAAGTT T GATAACAT 738 

Qy 479 T T C AAAG AT - G G CAT T T C C C C C AAT GAAAT AC AC AAGT AAAC AT T C 523 

II I I I I I I III I I I I I I I I I I I I II I I I I I I I I I I I I M I I I I 

Db 739 TT AAAAGAT GGGCGT T T C C C C CAAT GAAATACACAAGT AAAC AT T C 7 84 



RESULT 10 
US-10-136-639-4 

; Sequence 4, Application US/10136639 
; Publication No. US20030072761A1 
; GENERAL INFORMATION: 
; APPLICANT: LeBowitz, Jonathan 

; TITLE OF INVENTION: METHODS AND COMPOSITIONS FOR TARGETING PROTEINS ACROSS 
THE BLOOD BRAIN 

; TITLE OF INVENTION: BARRIER 
; FILE REFERENCE: SYM-008 

; CURRENT APPLICATION NUMBER: US/10/136, 639 
; CURRENT FILING DATE: 2002-09-06 

PRIOR APPLICATION NUMBER: US 60/329,650 

PRIOR FILING DATE: 2001-10-16 
; NUMBER OF SEQ ID NOS: 4 
; SOFTWARE: Patentln version 3.0 
; SEQ ID NO 4 

LENGTH: 72 60 
; TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-136-639-4 

Query Match 63.9%; Score 334.4; DB 15; Length 7260; 

Best Local Similarity 84.6%; Pred. No. 9e-99; 

Matches 445; Conservative 0; Mismatches 26; Indels 55; Gaps 4; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 311 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 370 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I M I I I I I I I I I I I I M I I I I 
Db 371 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 430 

Qy 121 AC AGGCAT C GT GGAT GAGT GCT GCT T C C GGAGCT GT GAT CT GAGGAGGCT GGAGAT GT AC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 431 ACAGGC AT C GT GGAT GAGT GCT GCT T C CGGAGCT GT GAT CTAAGGAGGCT GGAGAT GT AT 490 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

II I I I I I I I I I I I I I I II III I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I 

Db 4 91 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 550 

Qy 241 ATGCCCAAGACTCAGAAGTAT CAGCCT CCATCTACCAACAAGAAAATGAAGT CT CAGAGG 300 

I I I I I I I I I II III 

Db 551 ATGCCCAAGACCCAG 565 

Qy 301 AGAAGGAAAGGAAGT AC AT TT GAAGAAC ACAAGT AGAGGGAGT GCAGGAAACAAGAACT A 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I II II I I 
Db 566 AAGGAAGT AC AT T T GAAGAACGCAAGT AGAGGGAGT GCAGGAAACAAGAACT A 618 

Qy 361 C AGGAT GTAGGAAGAC C CT T CT GAG GAGT GAAGAAGGAC AGGCCACC GCAGGAC C CTT T G 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 619 C AGGAT GTAGGAAGAC C CT C CT GAGGAGT GAAGAGT GAC AT GCC AC C G C AGGAT C CT T T G 678 

Qy 421 CT CT GCAC-AGTTACCT G- TAAACATTGGAATACCGGCCAAAAAATAAGTTT GATCACAT 478 

I I I I I I I I I I I I II I I I I I I I I I I I I II III I I I I I II II I I I I I I I I I I I I I 



Db 67 9 CTCTGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATT^lAGTTTGATAACAT 738 

Qy 479 T T CAAAGAT - GGCATTT C C C C CAAT GAAAT AC ACAAGT AAACATTC 523 

II I I I I I I III I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I 
Db 739 T TAAAAGAT GGGCGTTTCCCC CAAT GAAAT AC ACAAGT AAACATTC 784 



RESULT 11 
US-10-207-655-54 

Sequence 54, Application US/10207655 
Publication No. US20030118592A1 
GENERAL INFORMATION: 
APPLICANT: Ledbetter, Jeffrey A. 
APPLICANT: Hayden-Ledbetter , Martha S. 

TITLE OF INVENTION: BINDING DOMAIN- IMMUNOGLOBULIN FUSION PROTEINS 
FILE REFERENCE: 390069. 401C1 
CURRENT APPLICATION NUMBER: US/ 10/207 , 655 
CURRENT FILING DATE: 2002-07-25 
NUMBER OF SEQ ID NOS : 42 6 
SOFTWARE: Patentln version 3.0 
SEQ ID NO 54 
LENGTH: 725 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-207-655-54 

Query Match 63.6%; Score 332.8; DB 15; Length 725; 

Best Local Similarity 84.4%; Pred. No. 9.2e-99; 

Matches 444; Conservative 0; Mismatches '27; Indels 55; Gaps 4; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 156 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 215 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 216 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 275 

Qy 121 ACAGGC AT C GT GGAT GAGT GCTGCTT C C GGAGCT GT GAT CT GAGGAGGCT GGAGAT GT AC 180 

I I I I I I I I I I I I I I I I II I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 27 6 ACAGGT AT C GT GGAT GAGT GCTGCTT C C GGAGCT GT GAT CTAAGGAGGCT GGAGAT GT AT 335 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

II I I II I I I I I I I I I I II III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 

Db 336 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 395 

Qy 241 ATGCCCAAGACT CAGAAGTATCAGC CT CCAT CT ACCAACAAGAAAAT GAAGT CT CAGAGG 300 

I I II I I I I I I I III 

Db 396 AT G C C C AAGAC C C AG 410 

Qy 301 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I M I I I I I 

Db 411 AAGGAAGT AC ATT T GAAGAAC G CAAGT AGAGGGAGTGC AGGAAACAAGAACTA 4 63 

Qy 361 CAGGAT GT AGGAAGAC C CT T CT GAGGAGT GAAGAAGGACAGGCCAC C GC AGGAC C CT TT G 420 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 464 CAGGAT GT AGGAAGAC C CT CCT GAG GAGT GAAGAGT GACAT GCCAC C GCAGGAT CCT TT G 523 



Qy 421 CT CT GCAC-AGTTACCT G- TAAACATTGGAATACCGGCCAAAAAATAAGTTTGAT CACAT 478 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I II I I I I I I 

Db 524 CT CT G CACGAGT T AC CT GT TAAACT T T GGAAC AC CTACCAAAAAATAAGT TT GATAACAT 583 

Qy 47 9 T T CAAAGAT - GGC ATT T C C C CCAAT GAAAT AC ACAAGT AAAC ATT C 523 

II I I I I I I III II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 584 T T AAAAGATGGGC GT T T C C C CCAAT GAAATAC ACAAGT AAAC AT T C 62 9 



RESULT 12 
US-10-251-661-7 

; Sequence 7, Application US/10251661 
; Publication No. US20030166555A1 
; GENERAL INFORMATION: 
; APPLICANT: Alberini, Cristina M. 
; APPLICANT: Bear, Mark F. 

; TITLE OF INVENTION: Methods and Compositions for Regulating 

; TITLE OF INVENTION: Memory Consolidation 

; FILE REFERENCE: 3499.1001-003 

; CURRENT APPLICATION NUMBER: US/ 10/251, 661 

; CURRENT FILING DATE: 2002-09-20 

; PRIOR APPLICATION NUMBER: 60/193,614 

; PRIOR FILING DATE: 2000-03-31 

; PRIOR APPLICATION NUMBER: PCT/US01/10661 

; PRIOR FILING DATE: 2001-04-02 

; NUMBER OF SEQ ID NOS : 12 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 7 
; LENGTH: 612 
TYPE : DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: ( 151 ) . . . ( 564 ) 
US-10-251-661-7 



Query Match 52.3%; Score 273.6; DB 13; Length 612; 

Best Local Similarity 83.0%; Pred. No. 2.6e-79; 

Matches 347; Conservative 0; Mismatches 19; Indels 52; Gaps 1; 



Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 247 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 306 



Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 307 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 366 

Qy 121 AC AGGC AT CGTGGAT GAGT GCT GCTT C C GGAGCT GT GAT CT GAG GAGGCT GGAGAT GTAC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 367 AC AGGC AT CGTGGAT GAGT GCT GCTT C C GGAGCT GT GAT CTAAGGAGG CT GGAGAT GT AT 426 



Qy 

Db 



181 
427 



T GT GC AC CCCT CAAGC C GGCAAAGG C AGC C CGCTCCGTCCGT G CC CAG C GC CACAC C GAC 240 
II I I I I I I I I I I I I I I II III I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I 
TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 4 86 



Qy 2 41 AT GC C CAAGACT CAGAAGT AT CAGCCT C CAT CT ACCAACAAGAAAATGAAGT CT CAGAGG 300 

II I I I I I I I II III 

Db 4 87 AT G C C C AAG AC C C AG 501 

Qy 301 AGAAG GAAAG GAAGT AC AT T T GAAGAACACAAGTAGAGGGAGT GCAGGAAACAAGAACT A 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 

Db 502 AAG GAAGT ACAT T T GAAGAAC GCAAGTAGAGGGAGT GCAG GAAACAAGAACTA 554 

Qy 3 61 CAGGAT GT AGGAAGACCCTT CT GAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTT 418 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 555 CAGGAT GT AGGAAGACCCT CCTGAGGAGTGAAGAGTGACAT GCCACCGCAGGAT CCTT 612 



RESULT 13 
US-09-852-261-11 

; Sequence 11, Application US/09852261 

; Patent No. US20020083477A1 

; GENERAL INFORMATION: 

; APPLICANT: GOLDSPINK, GEOFFREY 

; APPLICANT: TERENGHI, GIORGIO 

; TITLE OF INVENTION: REPAIR OF NERVE DAMAGE 

; FILE REFERENCE: 117-351 

CURRENT APPLICATION NUMBER: US/09/852, 261 

CURRENT FILING DATE: 2001-05-10 
; PRIOR APPLICATION NUMBER: GB 0011278.9 
; PRIOR FILING DATE: 2000-05-10 
; NUMBER OF SEQ ID NOS : 14 
; SOFTWARE: Patent In Ver. 2.1 
; SEQ ID NO 11 
; LENGTH: 487 
; TYPE: DNA 

ORGANISM: Rattus sp. 
US-09-852-261-11 

Query Match 50.1%; Score 262; DB 9; Length 487; 

Best Local Similarity 74.7%; Pred. No. 1.5e-75; 

Matches 396; Conservative 0; Mismatches 75; Indels 59; Gaps 3; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Qy 61 AG G G GCTT T TATT T CAACAAGCCC AC AGGAT AC GGCT C C AGCAGTC GGAGGGCAC CT CAG 120 

I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I III 

Db 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCATTCGGAGGGCACCACAG 120 

Qy 121 AC AGGC AT C GT GGAT GAGT GCTGCTTC C GGAGCT GT GAT CT GAGGAGGCT G GAGATGT AC 180 

II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 AC GGGCAT T GT GGAT GAGT GTTGCTTCCGGAGCTGT GAT CT GAGGAGGCT GGAGATGT AC 18 0 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

Db 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 240 



Qy 

Db 



241 AT GCCCAAGACT CAGAAGT AT CAGC CT C CAT CT AC CAACAAGAAAAT GAAGT CT CAGAGG 
I I I I I I I I I I I I I I I 

241 ATGCCCAAGACTCAG 



300 
255 



Qy 


301 


Db 


256 


Qy 


361 


Db 


309 


Qy 


421 


Db 


369 


Qy 


475 


Db 


429 



AGAAGGAAAGGAAGTACAT T T GAAGAAC ACAAGT AGAGGGAGT GC AGGAAACAAGAACT A 360 

I I I I I I I I I M I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I M 
AAGGAAGT AC ACT T GAAGAACACAAGT AGAGGAAGT GCAGGAAACAAGACCTA 308 



I M I I I I I I I I I I I I I I I I I I I I M I I I I I I 



CT CT GC ACAGT T ACCT GTAAAC AT T GGAATAC C GGCCA AAAAAT AAGT T T GAT C 474 

M I II II I I I I I I I I I I I I I I I I I I I I I I II 

CT GCT T GAG CAAC CT GCAAAACAT C GGAACAC CT GCCAAATAT CAATAAT GAGTT CAATA 42 8 

ACATTT CAAAGAT - GGCATTT CC CCCAAT GAAATACACAAGTAAACATT C 523 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
T CAT TT CAGAGAT GGGCAT TT C C CT CAAT GAAAT ACACAAGTAAACAT T C 478 



RESULT 14 
US-09-852-261-9 

; Sequence 9, Application US/09852261 

; Patent No. US20020083477A1 

; GENERAL INFORMATION: 

; APPLICANT: GOLDSPINK, GEOFFREY 

; APPLICANT: TERENGHI, GIORGIO 

; TITLE OF INVENTION: REPAIR OF NERVE DAMAGE 

; FILE REFERENCE: 117-351 

; CURRENT APPLICATION NUMBER: US/ 09/ 852 , 2 61 

; CURRENT FILING DATE: 2001-05-10 

; PRIOR APPLICATION NUMBER: GB 0011278.9 

; PRIOR FILING DATE: 2000-05-10 

; NUMBER OF SEQ ID NOS : 14 

; SOFTWARE: Patentln Ver. 2.1 

; SEQ ID NO 9 

LENGTH: 318 

TYPE: DNA 
; ORGANISM: Homo sapiens 
US-09-852-261-9 

Query Match 45.4%; Score 237.6; DB 9; Length 318; 

Best Local Similarity 94.6%; Pred. No. 1.3e-67; 

Matches 246; Conservative 0; Mismatches 14; Indels 0; Gaps 0, 

1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I | | M I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I M I I I I I I I I I I I I I I I I I 
1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

I I I I I I II I I I I I I I I I I I I I I I M I I I I II I I I I I I I I I I I I II I I I I I I I I I II I 

61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

121 AC AGGCAT C GT GGAT GAGT GCT GCTT CC GGAG CT GT GATCT GAGGAG G CT GGAGAT GT AC 180 

I || | | | I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
121 ACAGGCATCGTGGATGAGT GCTGCTT CCGGAGCT GT GAT CTAAGGAGGCT GGAGAT GTAT 180 

181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 
|| I I I I I I I I I I I I I I M Ml I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 



Db 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 240 



Qy 241 ATGCCCAAGACTCAGAAGTA 260 

I I I I I I I I I I I I I I I I I I 

Db 241 ATGCCCAAGACCCAGAAGGA 2 60 



RESULT 15 
US-10-238-114-1 

Sequence 1, Application US/10238114 
Publication No. US20030100073A1 
GENERAL INFORMATION: 
APPLICANT: Merial 

APPLICANT: ANDREONI , Christine Michele 

TITLE OF INVENTION: IGF-1 AS FELINE VACCINE ADJUVANT, IN PARTICULAR AGAINST 
FELINE RETROVIRUS 

FILE REFERENCE: 454313-3165.1 
CURRENT APPLICATION NUMBER: US/10/238,114 
CURRENT FILING DATE: 2002-09-10 
PRIOR APPLICATION NUMBER: FR 01 11736 
PRIOR FILING DATE: 2001-09-11 
PRIOR APPLICATION NUMBER: US 60/318,666 
PRIOR FILING DATE: 2001-09-12 
NUMBER OF SEQ ID NOS : 2 0 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 1 
LENGTH: 462 
TYPE: DNA 

ORGANISM: Felis catus 
US-10-238-114-1 

Query Match 43.6%; Score 22 8; DB 15; Length 4 62; 

Best Local Similarity 92.3%; Pred. No. 2.3e-64; 

Matches 240; Conservative 0; Mismatches 20; Indels 0; Gaps 0; 

GGAC C GGAGAC G CT CTGC GGT GCT GAGCT GGT GGAT GCT CT T CAGT T C GT GT GT GGAGAC 60 
| | I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
GGAC CAGAGAC GCT CTGT GGGGCT GAGT T GGT GGAC GCT CTT CAGT T C GT GT GT GGAGAC 204 

AGGGGCT TTT AT T T CAAC AAGCCCACAGGAT AC G GCT C CAGCAGT CGGAGGGCAC CT CAG 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I II I I I I I I I I I I I II I I I I I I 

AGGGGTTTTTATTTCAACAAGCCCACGGGGTATGGCTCCAGCAGTCGGAGGGCACCTCAG 264 

AC AGG CAT CGT G GAT GAGT GCT GCT T C C GGAG CT GTGAT CT GAGGAGGCT GGAGAT GTAC 180 

I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I M I I I I I I I I I I I INI I I I I I I I I I 

AC AG GCATC GT GGAT GAGT GCTGCT T C C GGAGCT GT GAT CT GAGGCGGCTAGAGAT GTAC 324 

TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

1 I I I I I I I I I 1 I I I I I I II III I I II I I I I I II I I I I I I I II I I I I I I I I III 

TGTGCACCCCTCAAGCCTGCCAAGTCTGCCCGCTCAGTCCGTGCTCAGCGCCACACTGAC 384 



I II I I II I I I I I II I II I 



Qy 


1 


Db 


145 


Qy 


61 


Db 


205 


Qy 


121 


Db 


265 


Qy 


181 


Db 


325 


Qy 


241 


Db 


385 



Search completed: December 13, 2003, 11:56:48 
Job time : 235.512 sees 



